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Description 

TECHNICAL FIELD 

5 The invention relates to transgenic non-human animals capable of producing heterologous antibodies, transgenes 
used to produce such transgenic animals, immortalized B-cells capable of producing heterologous antibodies, methods 
and vectors for disrupting endogenous immunoglobin loci, methods to generate a synthetic immunoglobulin variable 
region gene segment repertoire, and methods to induce heterologous antibody production. 

10 BACKGROUND OF THE INVENTION 

One of the major impediments facing the development of in vivo applications for monoclonal antibodies in humans 
is the intrinsic immunogenicity of non-human immunoglobulins. Patients respond to therapeutic doses of rodent mono- 
clonal antibodies by making antibodies against the rodent immunoglobulin sequences. These human anti-mouse anti- 

is bodies (HAMA) neutralize the therapeutic antibodies and can cause acute toxicity. The HAMA response is less dramatic 
in immunodeficient patients. Therefore, intrinsic immunogenicity has not prevented the use of rodent monoclonal anti- 
bodies for the treatment of graft rejection, which involves the temporary attenuation of the patient's immune response. 
Rodent antibodies may also be useful for treating certain lymphomas that involve immunodeficiencies. However, even 
immunodeficient patients can mount a HAMA response which leads to a reduction in safety and efficacy. 

20 The present technology for generating monoclonal antibodies involves pre-exposing, or priming, an animal (usually 
a rat or mouse) with antigen. This pre-exposure leads to the formation of splenic B-cells that secrete immunoglobulin 
molecules with high affinity for the antigen. Spleen cells from a primed animal are then fused with myxoma cells to form 
immortal, antibody secreting, hybridoma cells. Individual hybridoma clones are screened to identify those cells produc- 
ing immunoglobulins directed against a particular antigen. 

25 The genetic engineering of individual antibody genes has been proposed. Two genetic engineering approaches 
have been reported: chimeric antibodies and complementarity<letermining-region (CDR) grafting. The simplest 
approach, chimeric antibodies, takes advantage of the fact that the variable and constant portions of an antibody mol- 
ecule are encoded on separate axons. By simply fusing the variable region exons of a rearranged mouse antibody gene 
with a human constant region exons, a hybrid antibody gene can be obtained (Morrison, S.L., et al. (1984), Prpc. Natf, 

30 Acad. Sci. USA. 81, 6851 -6855). The major problem with this approach is that while the highly immunogenic mouse Fc 
region is eliminated, the remaining mouse Fab sequences are still immunogenic (Bruggemann, et al. (1989), iL&SL 
Med.. 170. 2153-2157). The CDR grafting approach uses computer modeling to generate a completely artificial anti- 
body in which the only mouse sequences are those involved in antigen binding (Riechmann, L, et al. (1988) , Nafem 
232. 323-327). Each of these approaches requires the prior characterization of a rodent monoclonal antibody directed 

35 against the antigen of interest, and both require the generation of a stable transfected ceil line that produces high levels 
of the engineered antibody. 

Another approach to the production of human antibodies is a proposal involving the construction of bacterial 
expression libraries containing immunoglobulin cDNA sequences (Orlandi, et al. (1989). Prop. Natl. Acad. SP- USA, 86, 
3833-3837, and Huse, et al. (1989), Science. 246. 1275-1281). This technique reportedly has only been used to gen- 
40 erate antibody fragments derived from mouse cDNA sequences. 

A number of experiments have reported the use of transfected cell lines to determine the specific DN A sequences 
required for Ig gene rearrangement (reviewed by Lewis and Gellert (1989). £s!L 52. 585-588). Such reports have iden- 
tified putative sequences and concluded that the accessibility of these sequences to the recombinase enzymes used 
for rearrangement is modulated by transcription (Yancopoulos and Alt (1985). Cell, 40. 271 -281). The sequences for 
45 V(D)J joining are reportedly a highly conserved, near-palindromic heptamer and a less well conserved AT-rich nanomer 
separated by a spacer of either 12 or 23 bp (Tonegawa (1983). Nature, 202* 575-581 ; Hesse, et al. (1989), £asnfi£_in 
Dev.. a, 1053-1061). Efficient recombination reportedly occurs only between sites containing recombination signal 
sequences with different length spacer regions. 

The production of transgenic mice containing various forms of immunoglobulin genes has also been reported. 
so Rearranged mouse immunoglobulin heavy or light chain genes have been used to produce transgenic mice. Such 
transgenes reportedly are capable of excluding the rearrangement of endogenous Ig genes. See e.g. Weaver et al. 
(1985), CfilL 42. 1 1M27; Iglesias, et al. (1987), NatUTA 220, 482-484; Storb et al. (1985), Banbury Reports, 20. 197- 
207; Neuberger et al. (1989), Nature. 338. 350-352; Hagman et al. (1989), J. Exp t Med., !£& 1911-1929; and Storb 
(1989) in Immunoglobulin Genes, Academic Press, T. Honjo, F.W. Alt and T.H. Rabbitts eds. pp. 303-326. In addition, 
55 functionally rearranged human Ig genes including the u or y 1 constant region have been expressed in transgenic mice. 
Yamamura, et al. (1986), Proc Natl Acad. Sci. USA. S2. 2152-2156; Nussenzweig, et al. (1987), SSSDSS. 226. 816- 
819. In the case of the rearranged heavy chain gene, allelic exclusion of endogenous immunoglobulin gene loci was 
reported. 

Allelic exclusion, however, does not always occur in all transgenic B-cells. See e.g. Rath, et al. (1 989), J, Immunol . 
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143, 2074-2080 (rearranged u gene construct); Manz, et al. (1988). J. Exp. Med .. 168. 1363-1381 &i transgenes lacking 
transmembrane exons did not prevent rearrangement of the endogenous genes); Ritchie, et al. (1984), Nature. 312. 
51 7-520 and Storb. et al. (1986). Immunol, Rev,, 22,85-102 (transgenic mice expressing rearranged k transgene capa- 
ble of forming stable heavy/light chain complex only rearrange endogenous k genes in B-cells that fail to correctly rear- 
range endogenous heavy chain gene); and Manz, et al. (1988), J. Exp. Med.. 168. 1363-1381 (transgenic mice 
containing k gene encoding light chain incapable of combining with heavy chains, show only a low level of allelic exclu- 
sion). See also Nussenzweig, et al. (1 988), Nature 336, 446-450); Durdik, et al. (1989). Proc. Nati. Acad. Sri. USA. 86. 
2346-2350; and Shirrdzu, et al. (1989), Proc. Natl.- Acad Sci USA gg, 8020-8023. 

Somatic mutation has also been reported in a 15 kb mouse k gene construct in hyperimmunized transgenic mice 
(O'Brien, et al. (1987), Nature 326. 405-409; Storb (1989) in Immunoglobulin Genes, Academic Press. T Honjo. F.W. 
Aft, and T.H. Rabbrtts, eds. pp. 303-326) and in the variable portion of a u heavy chain transgene (Durdik, et al (1989) 
Proc. Natl. Acad. Sci. USA, 86. 2346-2350). 

Ig gene rearrangement, though studied in tissue culture cells, has not been extensively examined in transgenic 
mice. Only a handful of reports have been published describing rearrangement test constructs introduced into mice 
[Buchini, et al. (1987), Nature. 326. 409-41 1 (prearranged chicken X transgene); Goodhart et al. (1987) . Proc. Natl. 
Acad, Sci, USA, 64 4229-4233) (unrearranged rabbit k gene); and Bruggemann, et al. (1989). Proc. Nati Acad Sri 
JJSA 66. 6709-6713 (hybrid mouse-human heavy chain)]. The results of such experiments, however, have been varia- 
ble, in some cases, producing incomplete or minimal rearrangement of the transgene. 

Based on the foregoing, it is clear that a need exists for heterologous monoclonal antibodies, e.g. antibodies of 
human origin, derived from a species other than human. Thus, it is an object of the invention herein to provide a source 
of monoclonal antibodies that may be used therapeutically in the particular species for which they are designed. 

In accordance with the foregoing object transgenic nonhuman animals are provided which are capable of producing 
a heterologous antibody, such as a human antibody. 

Further, it is an object to provide B-cells from such transgenic animals which are capable of expressing heterolo- 
gous antibodies wherein such B-cells are immortalized to provide a source of a monoclonal antibody specific for a par- 
ticular antigen. 

In accordance with this foregoing object, it is a further object of the invention to provide hybridoma ceils that are 
capable of producing such heterologous monoclonal antibodies. 

Still further, it is an object herein to provide heterologous unrearranged and rearranged immunoglobulin heavy and 
light chain transgenes useful for producing the aforementioned non-human transgenic animals. 

Still further, it is an object herein to provide methods to disrupt endogenous immunoglobulin loci in the transgenic 
animals. 

Still further, it is an object-herein to provide methods to induce heterologous antibody production in the aforemen- 
tioned transgenic non-human animal. 

A further object of the invention is to provide methods to generate an immunoglobulin variable region gene segment 
repertoire that is used to construct one or more transgenes of the invention. 

The references discussed herein are provided solely for their disclosure prior to the filing date of the present appli- 
cation. Nothing herein is to be construed as an admission that the inventors are not entitled to antedate such disclosure 
by virtue of prior invention. 

SUMMARY Of T HE INVENTION 

In accordance with the foregoing objects, in one aspect of the invention, transgenic non-human animals are pro- 
vided that contain rearranged, unrearranged or a combination of rearranged and unrearranged heterologous immu- 
noglobulin heavy and light chain transgenes in the germline of the transgenic animal. For each of the foregoing animals, 
functionally rearranged heterologous heavy and light chain immunoglobulin transgenes are found in the B<ells of the 
transgenic animal. 

Heterologous heavy and/or light unrearranged immunoglobulin transgenes are introduced into a host non-human 
animal to produce a transgenic non-human animal containing a heavy and a light heterologous immunoglobulin gene 
or an intermediate animal containing one or the other transgene. When incorporated into the germline of such interme- 
diate animals, crosses between one containing a heavy chain transgene and one containing a light chain transgene 
produces a transgenic non-human animal containing both heavy and light heterologous imrrunoglobulin transgenes. 

The transgenes of the invention indude a heavy chain transgene comprising DNA encoding at least one variable 
gene segment one diversity gene segment, one joining gene segment and one constant region gene segment The 
immunoglobulin light chain transgene comprises DNA encoding at least one variable gene segment one joining gene 
segment and one constant region gene segment. The gene segments encoding the light and heavy chain gene seg- 
ments are heterologous to the transgenic non-human animal in that they are derived from, or correspond to, DNA 
encoding immunoglobulin heavy and light chain gene segments from a species not consisting of the transgenic non- 
human animal. In one aspect of the invention, the transgene is constructed such that the individual gene segments are 
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prearranged, i.e., not rearranged so as to encode a functional immunoglobulin light or heavy chain. Such unrear- 
ranged transgenes permit recombination of the gene segments (functional rearrangement) and somatic mutation of the 
resultant rearranged immunoglobulin heavy and/or light chains within the transgenic non-human animal when exposed 
to antigen. 

5 In one aspect of the invention, heterologous heavy and light immunoglobulin transgenes comprise relatively large 
fragments of unrearranged heterologous DNA. Such fragments typically comprise a substantial portion of the C. J (and 
in the case of heavy chain. D) segments from a heterologous immunoglobulin locus. In addition, such fragments also 
comprise a substantial portion of the variable gene segments. ^ 
In an alternate embodiment. HP LaserJet Series IIHPLASEII.PRSegments. In such transgene constructs, the var- 
10 ious regulatory sequences. e.g. promoters, enhancers, class switch regions, recombination signals and the like, com- 
prise corresponding sequences derived from the heterologous DNA. Alternatively, such regulatory sequences may be 
incorporated into the transgene from the same or a related species of the non-human animal used in the inventor For 
example, human immunoglobulin gene segments may be combined in a transgene with a rodent immunoglobulin 
enhancer sequence for use in a transgenic mouse, 
is In a method of the invention, a transgenic non-human animal containing germline unrearranged light and heavy 
immunoglobulin transgenes - that undergo VDJ joining during D-cell differentiation - is contacted with an antigen to 
induce production of a heterologous antibody in a secondary repertoire B-cell. Such induction causes somatic mutation 
in the rearranged heavy and/or light chain transgenes contained in primary repertoire B-cells to produce a heterologous 
antibody having high affinity and specificity for the antigen. 

Such antibody producing B-cells may be immortalized by transforming with a virus, or with an oncogene containing 
DNA construct, or alternatively, immortalized by fusing with a myeloma cell line to form antibody secreting hybndomas. 
In each instance, clones having sufficient affinity and specificity for a particular antigen are selected to provide a source 
of monoclonal antibody having low immunogenicity in the species from which the immunoglobulin gene segments of the 

transgenes are derived. • ■ u. 

Also included in the invention are vectors and methods to disrupt the endogenous immunoglobulin loci in the non- 
human animal to be used in the invention. Such vectors and methods utilize a transgene. preferably positive-negative 
selection vector, which is constructed such that it targets the functional disruption of a class of gene segments encoding 
a heavy and/or light immunoglobulin chain endogenous to the non-human animal used in the invention. Such endog- 
enous gene segments include diversity, joining and constant region gene segments. In this aspect of the invention, the 
positive-negative selection vector is contacted with at least one embryonic stem cell of a non-human animal after which 
cells are selected wherein the positive-negative selection vector has integrated into the genome of the non-human ani- 
mal by way of homologous recombination. After transplantation, the resultant transgenic non-human animal is substan- 
tially incapable of mounting an immunoglobulin-mediated immune response as a result of homologous integration of the 
vector. Such immune def icient non-human animals may thereafter be used for study of immune deficiencies or used as 
35 the recipient of heterologous immunoglobulin heavy and light chain transgenes. 

The invention also includes methods for generating a synthetic variable region gene segment repertoire to be used 
in the transgenes of the invention. The method comprises generating a population of immunoglobulin V segment DNAs 
wherein each of the V segment DNAs encodes an immunoglobulin V segment and contains at each end a cleavage rec- 
ognition site of a restriction endonuclease. The population of immunoglobulin V segment DNAs is thereafter concate- 
40 nated to form the synthetic immunoglobulin V segment repertoire. 

Another aspect of the invention includes transgenic nonhuman animals that contain functionally rearranged heter- 
ologous heavy and light chain immunoglobulin transgenes in the germline of the transgenic animal. Such animals con- 
tain primary repertoire B-cells that express such rearranged heavy and light transgenes. Such B-cells are capable 
undergoing somatic mutation when contacted with an antigen to form a heterologous antibody having high affinity and 

45 specificity for the antigen. 

The invention also includes transgenic animals containing germ line cells having a heavy and light transgene 
wherein one of the said transgenes contains rearranged gene segments with the other containing unrearranged gene 
segments. In the preferred embodiments, the rearranged transgene is a light chain immunoglobulin transgene and the 
unrearranged transgene is a heavy chain immunoglobulin transgene. 

so The invention also includes methods for producing heterologous antibodies in a transgenic animal containing pri- 
mary repertoire B-cells having rearranged heavy and light heterologous immunoglobulin transgenes. Such transgenic 
animals may be obtained from any of the aforementioned transgenic animals. Thus, the transgenic animal containing 
unrearranged heavy and light transgenes, the transgenic animal containing rearranged heavy and light transgenes or 
the animal containing one rearranged and one unrearranged transgene in the germline of the animal, each contain pn- 

55 mary repertoire B-cells having rearranged, heterologous heavy and light immunoglobulin transgenes. In the method of 
the invention, a desired first heterologous antibody is produced which is capable of binding a first antigen. The rear- 
ranged immunoglobulin heavy and light transgenes in the primary repertoire B-cells of such animals are known to pro- 
duce primary repertoire antibodies having sufficient affinity for a second known antigen. In this method, the transgenic 
non-human animal is contacted, sequentially or simultaneously, with the first and second antigen to induce production 
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of the first heterologous antibody by somatic mutation of the rearranged transgenes. The secondary repertoire B^eils 
so produced are then manipulated as previously described to immortalize the production of the desired monoclonal 
antibody capable of binding the f irst antigen. 

The present invention also includes plasmids. useful in cloning large DNA fragments (e.g., immunoglobulin 
genomic fragments), that have an origin of replication (ORQ, a copy control region (e.g., ROP, or the copy control region 
of pACYC177, or others known to those skilled in the art), and a cloning site. The plasmids also include a transcription 
terminator (e.g., trpR or others known to those skilled in the art) downstream of endogenous plasmid-derived promoters 
such as that of the ampictllin resistance gene feme"). The transcription termination is located upstream of the cloning 
site so that transcripts originating at the promoter are terminated upstream of the cloning sHe. In a preferred embodi- 
ment, the cloning site is flanked by rare restriction sites, which are sites consisting of seven, eight or more nucleotides 
instead of the six or fewer nucleotides that make up more common restriction sites; e.g., Not I, Sfi I, and Pac I Rare 
restriction sites also include sites that contain nucleotide sequences occurring rarely in natural DNA sequences- i e. 
less frequently than abort once in every 8,000-1 0.000 nucleotides. 

BRIEF DESCRIPTION OF THE FIGURES 

Rg. 1 depicts the complementarity determining regions CDR1 . CDR2 and CDR3 and framework regions FR1 , FR2, 
FR3 ami FR4 in unrearranged genomic DNA and mRNA expressed from a rearranged immunoglobulin heavy 
chain gene, 

Rg. 2 depicts the human X chain locus, 
Rg. 3 depicts the human k chain locus, 
Rg. 4 depicts the human heavy chain locus, 

Rgs. 5 and 6 depict the strategy for generating a synthetic V segment repertoire. 

Fig. 7 depicts the strategy for functional disruption of endogenous immunoglobulin loci. 

Rg. 8 depicts the Tcell mediated secondary response leading to maturation of the B-cell. 

Rg. 9 depicts somatic mutation and clonal expansion of B-cells in response to two different antigens. 

Fig. 10 depicts a transgene construct containing a rearranged IgM gene ligated to a 25 kb fragment that contains 

human y3 and y 1 constant regions followed by a 700 bp fragment containing the rat chain 3' enhancer sequence. 

Fig. 1 1 is a restriction map of the human k chain locus depicting the fragments to be used to form a light chain 

transgene by way of m yjyo homologous recombination. 

Fig. 12 depicts the construction of pGP1 . 

Fig. 13 depicts the construction of the polylinker contained in pGP1. 

Fig. 14 depicts the fragments used to construct a human heavy chain transgene of the invention. 
Fig. 1 5 depicts the construction of pHIGl and pCON1 . 

Rg. 16 depicts the human Cy1 fragments which are inserted into pRE3 (rat enhancer 3*) to form pREG2 
Fig. 1 7 depicts the construction of pHIG3' and PCON. 

Rg. 18 depicts the fragment containing human D region segments used in construction of the transgenes of the 
invention. 

Fig. 19 depicts the construction of pHIG2 (D segment containing plasmid). 

Rg. 20 depicts the fragments covering the human Jk and human Ck gene segments used in constructing a trans- 
gene of the invention. 
Fig. 21 depicts the structure of pEji. 
Fig. 22 depicts the construction of pKapH. 

Figs. 23A through 23D depict the construction of a positive-negative selection vector for functionally disrupting the 
endogenous heavy chain immunoglobulin locus of mouse. 

Figs. 24A through 24C depict the construction of a positive-negative selection vector for functionally disrupting the 

endogenous immunoglobulin light chain loci in mouse. 

Figs. 25 a through e depict the structure of a kappa light chain targeting vector. 

Figs. 26 a through f depict the structure of a mouse heavy chain targeting vector. 

Rg. 27 depicts the map of vector pGPe. 

Rg. 28 depicts the structure of vector pJM2. 

Fig. 29 depicts the structure of vector pCOR1 . 

Rg. 30 depicts the transgene constructs for plGM1 , pHCl and pHC2. 

Rg. 31 depicts the structure of pye2. 

Fig. 32 depicts the structure of pVGE1. 

Rg. 33 depicts the assay results of human Ig expression in a pHCl transgenic mouse. 
Rg. 34 depicts the structure of pJCK1 . 

Rg. 35 depicts the construction of a synthetic heavy chain variable region. 
Table 1 depicts the sequence of vector pGPe. 
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Table 2 depicts the sequence of gene Vh49.8. 

nFTAII Fn DESCRIPTION 

The design of a transgenic non-human animal that responds to foreign antigen stimulation with a heterologous anti- 
body repertoire requires that the heterologous immunoglobulin transgenes contained within the transgenic animal 
function correctly throughout the pathway of B-cell development: Accordingly, the transgenes in one aspect of the inven- 
tion are constructed so as to produce one or all of the following: (1) high level and cell-type specific expression, (2) func- 
tional gene rearrangement, (3) activation of and response to allelic exclusion. (4) expression of a sufficient primary 
repertoire. (5) signal transduction. (6) class switching. (7) somatic hypermutation. and (8) domination of the transgene 
antibody locus during the immune response. . 

As will be apparent from the following disclosure, not all of the foregoing criteria need be met. For example, in those 
embodiments wherein the endogenous immunoglobulin loci of the transgenic animal are functionally disrupted, the 
transgene need not activate allelic exclusion. Further, in those embodiments wherein the transgene comprises a func- 
tionally rearranged heavy and/or light chain immunoglobulin gene, the second criteria of functional gene rearrangement 
is unnecessary, at least for that transgene which is already rearranged. For background on molecular immunology, sefi. 
Fundamental immunology. 2nd edition (1989). Paul William E., ed. Raven Press. N.Y. 

The Structure and Generatio n of Antibodies 

Immunoglobulins, also known as antibodies, are a group of glycoproteins present in the serum and tissue fluids of 
all mammals They are produced in large amounts by plasma cells (also referred to herein as "secondary repertoire B- 
cells") which develop from precursor B lymphocytes (referred to herein as "primary repertoire B-cells"). Such pnmary 
repertoire B-cells carry membrane-bound immunoglobulin which is similar to that produced by the fully differentiated 
secondary repertoire B-cell. Contact between primary repertoire B-cells and foreign antigen is required tor the induction 
of antibody formation. . . 

The basic structure of all immunoglobulins is based upon a unit consisting of two identical light polypeptide chains 
and two identical heavy polypeptide chains linked together by disulfide bonds. Each light chain comprises two regions 
known as the variable light chain region and the constant light chain region. Similarly, the immunoglobulin heavy chain 
comprises two regions designated the variable heavy chain region and the constant heavy chain region. The constant 
region tor the heavy or light chain is encoded by genomic sequences referred to as heavy or light constant region gene 
segments The use of a particular heavy chain gene segment defines the class of immunoglobulin. For example, in 
humans the n constant region gene segments define the IgM class of antibody whereas the use of a y. y2. y3 or 74 con- 
stant region gene segment defines the IgG class of antibodies as well as the IgG subclasses lgG1 through lgG4. 

The variable regions of the heavy and light immunoglobulin chains together contain the antigen binding domain of 
the antibody. Because of the need for diversity in this region of the antibody to permit binding to a wide range of anti- 
gens the DNA encoding the initial or primary repertoire variable region comprises a number of different DNA segments 
derived from families of specific variable region gene segments. In the case of the light chain variable region, such fam- 
ilies comprise variable (V) gene segments and joining (J) gene segments. Thus, the initial variable region of the light 
chain is encoded by one V gene segment and one J gene segment each selected from the family of V and J gene seg- 
ments contained in the genome DNA of the organism. In the case of the heavy chain variable region, the DNA encoding 
the initial or primary repertoire variable region of the heavy chain comprises one heavy chain V gene segment, one 
heavy chain diversity (D) gene segment and one J gene segment, each selected from the appropriate V, D and J fam- 
ilies of immunoglobulin gene segments in genomic DNA. 

The Primary R epertoire 

The process for generating DNA encoding the heavy and light chain immunoglobulin genes occurs primarily in 
developing B-cells. Prior to the joining of various immunoglobulin gene segments, the V, D. J and constant (C) gene 
segments are found, for the most part, in clusters of V. D. J and C gene segments in the precursors of primary repertoire 
B-cells. Generally, all of the gene segments for a heavy or light chain are located in relatively close proximity on a single 
chromosome. Such genomic DNA prior to recombination of the various immunoglobulin gene segments is referred to 
herein as "unrearranged" genomic DNA. During B-cell differentiation, one of each of the appropriate family members of 
the V. D J (or only V and J in the case of light chain genes) gene segments are recombined to form functionally rear- 
ranged heavy and light immunoglobulin genes. Such functional rearrangement is of the variable region segments to 
form DNA encoding a functional variable region. This gene segment rearrangement process appears to be sequerrtal. 
First heavy chain D-to-J joints are made, followed by heavy chain V-to-DJ joints and light chain V-to-J joints. The DNA 
encoding this initial form of a functional variable region in a light and/or heavy chain is referred to as "functionally rear- 
ranged DNA" or "rearranged DNA". In the case of the heavy chain, such DNA is referred to as "rearranged heavy chain 
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DNA" and in the case of the fight chain, such DNA is referred to as "rearranged light chain DNA". Similar language is 
used to describe the functional rearrangement of the transgenes of the invention. 

The recombination of variable region gene segments to form functional heavy and light chain variable regions is 
mediated by recombination signal sequences (RSS's) that flank recombinationalty competent V, D and J segments. 

5 RSS's necessary and sufficient to direct recombination, comprise a dyad-symmetric heptamer, an AT-rich nonamer and 
an intervening spacer region of either 12 or 23 base pairs. These signals are conserved among the different loci and 
species that carry out D-J (or V-J) recombination and are functionally interchangeable. See Oettinger, et a). (1990), Sci- 
ence. 248, 151 7-1523 and references cited therein. The heptamer comprises the sequence CACAGTG or its analogue 
followed by a spacer of unconsented sequence and then a nonamer having the sequence ACAAAAACC or its ana- 

10 logue. These sequences are found on the J, or downstream side, of each V and D gene segment Immediately preced- 
ing the germline D and J segments are again two recombination signal sequences, first the nonamer and then the 
heptamer again separated by an unconsented sequence. The heptameric and nonameric sequences following a V L , V H 
or D segment are complementary to those preceding the J L , Dor J H segments with which they recombine. The spacers 
between the heptameric and nonameric sequences are either 12 base pairs long or between 22 and 24 base pairs long. 

75 In addition to the rearrangement of V, D and J segments, further diversity is generated in the primary repertoire of 
immunoglobulin heavy and light chain by way of variable recombination between the V and J segments in the light chain 
and between the D and J segments of the heavy chain. Such variable recombination is generated by variation in the 
exact place at which such segments are joined. Such variation in the light chain typically occurs within the last codon 
of the V gene segment and the first codon of the J segment. Similar imprecision in joining occurs on the heavy chain 

20 chromosome between the D and J H segments and may extend over as many as 10 nucleotides. Furthermore, several 
nucleotides may be inserted between the D and J H and between the V H and D gene segments which are not encoded 
by genomic DNA. The addition of these nucleotides is known as N-region diversity. 

After VJ and/or VDJ rearrangement, transcription of the rearranged variable region and one or more constant 
region gene segments located downstream from the rearranged variable region produces a primary RNA transcript 

25 which upon appropriate RNA splicing results in an mRNA which encodes a full length heavy or light immunoglobulin 
chain. Such heavy and light chains include a leader signal sequence to effect secretion through and/or insertion of the 
immunoglobulin into the transmembrane region of the B-cell. The DNA encoding this signal sequence is contained 
within the first exon of the V segment used to form the variable region of the heavy or light immunoglobulin chain. Appro- 
priate regulatory sequences are also present in the mRNA to control translation of the mRNA to produce the encoded 

30 heavy and light immunoglobulin polypeptides which upon proper association with each other form an antibody mole- 
cule. 

The net effect of such rearrangements in the variable region gene segments and the variable recombination which 
may occur during such joining, is the production of a primary antibody repertoire. Generally, each B-cell which has dif- 
ferentiated to this stage, produces a single primary repertoire antibody. During this differentiation process, cellular 
35 events occur which suppress the functional rearrangement of gene segments other than those contained within the 
functionally rearranged Ig gene. The process by which diploid B-cells maintain such mono-specificity is termed allelic 
exclusion. 

The Secondary Repertoire 

40 

B-ceil clones expressing immunoglobulins from within the set of sequences comprising the primary repertoire are 
immediately available to respond to foreign antigens. Because of the limited diversity generated by simple VJ and VDJ 
joining, the antibodies produced by the so-called primary response are of relatively low affinity. Two different types of B- 
cefls make up this initial response: precursors of primary antibody-forming cells and precursors of secondary repertoire 

45 B-cells (Linton, et al. (1989). QsSL 52, 1049-1059). The first type of B-cell matures into IgM-secreting plasma cells in 
response to certain antigens. The other B-cells respond to initial exposure to antigen by entering a T-cell dependent 
maturation pathway. It is during this T-cell dependent maturation of B-cells that a second level of diversity is generated 
by a process termed somatic mutation (sometimes also referred to as hypermutaiion). These primary repertoire B-cells 
use the immunoglobulin molecules on their surfaces to bind and internalize the foreign antigen. If the foreign antigen is 

so a protein or is physically finked to another protein antigen, that protein antigen is then processed and presented on the 
cell surface by a major histocompatibility complex (MHC) molecule to a helper T-cell which in turn induces maturation 
of the B-cell. Lanzavecchia (1985), Nature. 314. 537. This overall maturation of the B-cell is known as the secondary 
response. 

During the T-cell dependent maturation of antigen stimulated B-celi clones, the structure of the antibody molecule 
55 on the cell surface changes in two ways: the constant region switches to a non-IgM subtype and the sequence of the 
variable region is modified by multiple single amino acid substitutions to produce a higher affinity antibody molecule. It 
is this process of somatic mutation, followed by the selection of higher affinity clones, that generates highly specific and 
tightly binding immunoglobulins characterized by the Ig mediated immune response. 

As previously indicated, each variable region of a heavy or fight Ig chain contains an antigen binding domain. It has 
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been determined by amino acid and nucleic acid sequencing that somatic mutation during the secondary response 
occurs throughout the V region including the three complementary determining regions (CDR1 , CDR2 and CDR3) also 
referred to as hypervariable regions 1, 2 and 3. The CDR1 and CDR2 are located within the variable gene segment 
whereas the CDR3 is largely the result of recombination between V and J gene segments or V, D and J gene segments. 
Those portions of the variable region which do not consist of CDR1, 2 or 3 are commonly referred to as framework 
regions designated FR1 , FR2, FR3 and FR4. See Fig. 1 . During hypermutation, the rearranged DNA is mutated to give 
rise to new clones with altered Ig molecules. Those clones with higher affinities for the foreign antigen are selectively 
expanded by helper T-cells, giving rise to affinity maturation of the expressed antibody. Clonal selection typically results 
in expression of clones containing new mutation within the CDR1 , 2 and/or 3 regions. However, mutations outside these 
regions also occur which influence the specificity and affinity of the antigen binding domain. 

Transgenic Non-Human Animals Capable of Producing Heterologous Antibody 

Transgenic non-human animals in one aspect of the invention are produced by introducing at least one of the immu- 
noglobulin transgenes of the invention (discussed hereinafter) into a zygote or early embryo of a non-human animal. 
The non-human animals which are used in the invention generally comprise any mammal which is capable of rearrang- 
ing immunoglobulin gene segments to produce a primary antibody response and, which, in addition, are capable of 
mounting a secondary response by way of somatic mutation of such rearranged Ig genes. A particularly preferred non- 
human animal is the mouse or other members of the rodent family. Mice are particularly useful since their immune sys- 
tem has been extensively studied, including the genomic organization of the mouse heavy and light immunoglobulin 
loci. See e.g. Immunoglobulin Genes. Academic Press, T. Honjo, F.W. Alt and T.H. Rabbitts, eds. (1989). 

However, the invention is not limited to the use of mice. Rather, any non-human mammal which is capable of 
mounting a primary and secondary antibody response may be used. Such animals include non-human primates, such 
as chimpanzee, bovine, ovine and porcine species, other members of the rodent family, e.g. rat, as well as rabbit and 
guinea pig. Particular preferred animals are mouse, rat, rabbit and guinea pig, most preferably mouse. 

As used herein, the term "antibody" refers to a glycoprotein comprising at least two identical light polypeptide 
chains and two identical heavy polypeptide chains linked together by disulfide bonds. Each of the heavy and light 
polypeptide chains contains a variable region (generally the amino terminal portion of the polypeptide chain) which con- 
tains a binding domain which interacts with antigen. Each of the heavy and light polypeptide chains also comprises a 
constant region of the polypeptide chains (generally the carboxyl terminal portion) some of which sequences mediate 
the binding of the immunoglobulin to host tissues including various cells of the immune system, some phagocytic cells 
and the first component (C1q) of the classical complement system. 

As used herein, a "heterologous antibody" is defined in relation to the transgenic non-human organism producing 
such an antibody. It is defined as an antibody having an amino acid sequence or an encoding DNA sequence corre- 
sponding to that found in an organism not consisting of the transgenic non-human animal. Thus, prior to rearrangement 
of a transgene containing various heavy or light chain gene segments, such gene segments may be readily identified, 
e.g. by hybridization or DNA sequencing, as being from a species of organism other than the transgenic animal. For 
example, in one embodiment of the invention, various gene Segments from the human genome are used in heavy and 
light chain transgenes in an unrearranged form. In this embodiment, such transgenes are introduced into mice. TTie 
unrearranged gene segments of the light and/or heavy chain transgene have DNA sequences unique to the human 
species which are distinguishable from the endogenous immunoglobulin gene segments in the mouse genome. They 
may be readily detected in unrearranged form in the germ line and somatic cells not consisting of B-cells and in rear- 
ranged form in B-cells. 

In an alternate embodiment of the invention, the transgenes comprise rearranged heavy and/or light immunoglob- 
ulin transgenes. Specific segments of such transgenes corresponding to functionally rearranged VDJ or VJ segments, 
contain immunoglobulin DNA sequences which are also clearly distinguishable from the endogenous immunoglobulin 
gene segments in the mouse. 

Such differences in DNA sequence are also reflected in the amino acid sequence encoded by such human immu- 
noglobulin transgenes as compared to those encoded by mouse B-cells. Thus, human immunoglobulin amino acid 
sequences may be detected in the transgenic non-human animals of the invention with antibodies specific for immu- 
noglobulin epitopes encoded by human immunoglobulin gene segments. 

Transgenic B-cells containing unrearranged transgenes from human or other species functionally recombine the 
appropriate gene segments to form functionally rearranged light and heavy chain variable regions, ft is to be understood 
that the DNA of such rearranged transgenes for the most part will not correspond exactly to the DNA sequence of the 
gene segments from which such rearranged transgenes were obtained. This is due primarily to the variations intro- 
duced during variable recombination and because of mutations introduced by hypermutation during the secondary 
response. Notwithstanding such modifications in DNA (as well as in amino acid) sequence, it will be readily apparent 
that the antibody encoded by such rearranged transgenes has a DNA and/or amino acid sequence which is heterolo- 
gous to that normally encountered in the non-human animal used to practice the invention. 
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The term "substantia) identity", when referring to polypeptides, indicates that the polypeptide or protein in question 
exhibits at least about 30% identity with an entire naturally occurring protein or a portion thereof, usually at least about 
70% identity, and preferably at least about 95% identity As used herein, the terms "isolated", "substantially pure" and 
"substantially homogenous" are used interchangeably herein and describe a polypeptide protein which has been sep- 

5 arated from components which naturally accompany it Typically, a monomelic protein is substantially pure when at 
least about 60 to 75% of a sample exhibits a single polypeptide backbone Minor variants or chemical modifications typ- 
ically share the same polypeptide sequence. A substantially pure protein will typically comprise over about 85 to 90% 
of a protein sample, more usually about 95%, and preferably will be over about 99% pure. Protein purity or homogeneity 
may be indicated by a number of means well known in the art. such as polyacrylamide gel electrophoresis of a protein 

to sample, followed by visualizing a single polypeptide bard on a polyacrylamide gel upon staining. For certain purposes 
high resolution will be needed and HPLC or a similar means for purification utilized. A polypeptide is substantially free 
of naturally-associated components when it is separated from the native contaminants which accompany it in its natural 
state. Thus, a polypeptide which is synthesized in a cellular system different from the cell from which it naturally origi- 
nates will be substantially free from its naturally-associated components. 

Unrearranged Transgenes 

As used herein, an "unrearranged immunoglobulin heavy chain transgene" comprises DNA encoding at least one 
variable gene segment, one diversity gene segment, one joining gene segment and one constant region gene segment. 
20 Each of the gene segments of said heavy chain transgene are derived from, or has a sequence corresponding to, DNA 
encoding immunoglobulin heavy chain gene segments from a species not consisting of the non-human animal into 
which said transgene is introduced. Similarly, as used herein, an "unrearranged immunoglobulin light chain transgene" 
comprises DNA encoding at least one variable gene segment, one joining gene segment and at least one constant 
region gene segment wherein each gene segment of said light chain transgene is derived from, or has a sequence cor- 

6 responding to, DNA encoding immunoglobulin light chain gene segments from a species not consisting of the non- 
human animal into which said light chain transgene is introduced. 

Such heavy and light chain transgenes in this aspect of the invention contain the above-identified gene segments 
in an unrearranged form. Thus, interposed between the V, D and J segments in the heavy chain transgene and between 
the V and J segments on the light chain transgene are appropriate recombination signal sequences (RSS's). In addi- 

30 tion, such transgenes also include appropriate RN A splicing signals to join a constant region gene segment with the VJ 
or VDJ rearranged variable region. 

To the extent that the heavy chain transgene contains more than one C region gene segment e.g. Cn and Cyl from 
the human genome, as explained below "switch regions* are incorporated upstream from each of the constant region 
gene segments and downstream from the variable region gene segments to permit recombination between such con- 

35 stant regions to allow for immunoglobulin class switching, e.g. from IgM to IgG. Such heavy and light immunoglobulin 
transgenes also contain transcription control sequences including promoter regions situated upstream from the variable 
region gene segments which contain OCTA and TATA motifs. 

In addition to promoters, other regulatory sequences which function primarily in B-lineage cells are used. Thus, for 
example, a light chain enhancer sequence situated preferably between the J and constant region gene segments on 

40 the light chain transgene is used to enhance transgene expression, thereby facilitating allelic exclusion. In the case of 
the heavy chain transgene, regulatory enhancers and also employed. 

Although the foregoing promoter and enhancer regulatory control sequences have been generically described, 
such regulatory sequences may be heterologous to the nonhuman animal being derived from the genomic DNA from 
which the heterologous transgene immunoglobulin gene segments are obtained. Alternately, such regulatory gene seg- 

45 ments are derived from the corresponding regulatory sequences in the genome of the non-human animal, or closely 
related species, which contains the heavy and light transgene. Such regulatory sequences are used to maximize the 
transcription and translation of the transgene so as to induce allelic exclusion and to provide relatively high levels of 
transgene expression. 

In the preferred embodiments, each of the immunoglobulin gene segments contained on the heavy and light Ig 
so transgenes are derived from, or have sequences corresponding to, genomic DNA, cDNA or portions thereof from a spe- 
cies or individual which is heterologous to the non-human animal into which the transgene is to be introduced. As a con- 
sequence, when such gene segments are functionally rearranged and hypermutated in the transgenic non-human 
animal, the heterologous antibody encoded by such heavy and light transgenes will have sin amino acid sequence and 
overall secondary and terteriary structure which provides specific utility against a desired antigen when used therapeu- 
55 tically in the organism from which the Ig gene segments are derived. In addition, such antibodies demonstrate substan- 
tially reduced immunogenicity as compared to antibodies which are "foreign" to the organism treated. 

For example, in the preferred embodiments, gene segments are derived from human beings. The transgenic non- 
human animals harboring such heavy and light transgenes are capable of mounting an Ig-mediated immune response 
to a specific antigen administered to such an animal. B-cells are produced within such an animal which are capable of 
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producing heterologous human antibody. After immortalization, and the selection for an appropriate monoclonal anti- 
body (Mab), e.g. a hybridoma, a source of therapeutic human monoclonal antibody is provided. Such human Mabs 
have significantly reduced immunogenicity when therapeutically administered to humans. 

Examples of antigens which may be used to generate heterologous antibodies in the transgenic animals of the 

5 invention containing human immunoglobulin transgenes include bacterial, viral and tumor antigen as well as particular 
human B- and T-cell antigens associated with graft rejection or autoimmunity. 

Although the preferred embodiments disclose the construction of heavy and light transgenes containing human 
gene segments, the invention is not so limited. In this regard, it is to be understood that the teachings described herein 
may be readily adapted to utilize immunoglobulin gene segments from a species other than human beings. For exam- 

10 pie, in addition to the therapeutic treatment of humans with the antibodies of the invention, therapeutic antibodies 
encoded by appropriate gene segments may be utilized to generate monoclonal antibodies for use in the veterinary sci- 
ences. For example, the treatment of livestock and domestic animals with species-related monoclonal antibodies is also 
contemplated by the invention. Such antibodies may be similarly generated by using transgenes containing immu- 
noglobulin gene segments from species such as bovine, ovine, porcine, equine, canine, feline and the like. 

15 

Class Switching 

The use of or S constant regions is largely determined by alternate splicing, permitting IgM and IgD to be coex- 
pressed in a single cell. The other heavy chain isotypes (y, a, and s) are only expressed natively after a gene rearrange- 

20 ment event deletes the Cji and CS exons. This gene rearrangement process, termed class switching, occurs by 
recombination between so called switch segments located immediately upstream of each heavy chain gene (except 6). 
The individual switch segments are between 2 and 10 kb in length, and consist primarily of short repeated sequences. 
The exact point of recombination differs for individual class switching events. 

The ability of a transgene construction to switch isotypes during B-cell maturation has not been directly tested in 

25 transgenic mice; however, transgenes should carry out this function, Durdik et al. (1 989) Proc. Natl. Acad. Sci. USA. 86. 
2346-2350) microinjected a rearranged mouse p. heavy chain gene construct and found that in four independent mouse 
lines, a high proportion of the transgenic B-cells expressed the transgene-encoded variable region associated with IgG 
rather than IgM. Thus, isotype switching appears to have taken place between the transgene and the endogenous y 
constant region on another chromosome. 

30 As used herein, the term switch sequence thus refers to those DNA sequences responsible for switch recombina- 
tion. A "switch donor" sequence, typically a \i switch region, will be 5' (i.e., upstream) of the construct region to be 
deleted during the switch recombination. The "switch acceptor" region will be between the construct region to be 
deleted and the replacement constant region (e.g., y, e, etc.). As there is no specific site where recombination always 
occurs, the final gene sequence will typically not be predictable from the construct. 

35 The switch (S) region of the ji gene, S M , is located about 1 to 2 kb 5' to the coding sequence and is composed of 
numerous tandem repeats of sequences of the form (GAGCT) n (GGGGT), where n is usually 2 to 5 but can range as 
high as 17. (See T Nikaido. et al. (1981): Nature. 222:845-848.) 

Similar internally repetitive switch sequences spanning several kilobases have been found 5' of the other C H genes. 
The Sa region has been sequenced and found to consist of tandemly repeated 80-bp homology units, whereas Sy2a. 

40 Sy2b. and $yz all contain repeated 49-bp homology units very similar to each other. (See, P. Szurek, et al. (1985): JL 
Immunol. 135:620-626 and T. Nikaido, et al. (1982): J. Biol. Chem.. 257: 7322-7329.) All the sequenced S regions 
include numerous occurrences of the pentamers GAGCT and GGGGT that are the basic repeated elements of the 
gene(T. Nikaido, et al. (1982): J. Biol. Chem.. 257:7322-7329); in the other S regions these pentamers are not precisely 
tandemly repeated as in S^, but instead are embedded in larger repeat units. 

45 The S r1 region has an additional higher-order structure: two direct repeat sequences flank each of two clusters of 
49-bp tandem repeats. (See M. R. Mowatt, et al (1986): J. Immunol.. 136:2674-2683). Switch regions of human H chain 
genes have been found very similar to their mouse homologs. Generally, unlike the enzymatic machinery of V-J recom- 
bination, the switch machinery can apparently accommodate different alignments of the repeated homologous regions 
of germline S precursors and then join the sequences at different positions within the alignment. (See. T. H. Rabbits, et 

so al (1981): Nucleic Acids Res.. 2:4509-4524 and J. Ravetch, et al. (1980): Proc. Natl. Acad. Sci. USA. ZZ:6734-6738.) 
Induction of class switching appears to be associated with sterile transcripts that initiate upstream of the switch 
segments (Lutzker et al.. 1988 Mol. Cell. Biol., & 1849; Stavnezer et al. 1988 Proc. Natl. Acad. Sci. USA. 85, 7704; 
Esser and Radbruch 1989 EMBO J ., ft- 483; Berton et al. 1989 Proc. Natl. Acad. Sci. USA. S£, 2829; Rothman et al. 
1 990 Int. Immunol . 2, 621). For example, the observed induction of the y1 sterile transcript by IL-4 and inhibition by IFN- 

55 y correlates with the observation that IL-4 promotes class switching to y1 in B-cells in culture, while IFN-y inhibits y1 
expression. Ideally then, transgene constructs that are intended to undergo class switching should include all of the cis- 
acting sequences necessary to regulate these sterile transcripts. An alternative method for obtaining class switching in 
transgenic mice (op. and eh) involves the inclusion of the 400 bp direct repeat sequences that flank the human \i gene 
ffesui et al. 1989 Eur. J. Immunol.. 19. 1399). Homologous recombination between these two sequences deletes the ji 
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gene in IgD-only B-cells. 
Monoclona l Antibodies 

Monoclonal antibodies can be obtained by various techniques familiar to those skilled in the art. Brief ly. spleen cells 
from an animal immunized with a desired antigen are immortalized, commonly by fusion with a myeloma cell (see, Koh- 
lerand Milstein. Eur J. Immunol , ftS1 1 -519 (1976)). Alternative methods of immortalization include transformation with 
Epstem Barr Virus, oncogenes, or retroviruses, or other methods well known in the art Colonies arising from single 
.nTirortateed celte are screened for production of antibodies of the desired specificity and affinity for the antigen, and 
2 * J™ 0 ? 0 " 3 ' artbodies P^ced by such cells may be enhanced by various techniques, including injection 
into the peritoneal cavrty of a vertebrate host Various techniques useful in these arts are discussed, for example in 
Harlow and Lane. Antibodies; A Laboratory Manual Cold Spring Harbor. New Vbrk (1988) including: immunization of 
amrnals to produce immunoglobulins; production or monoclonal antibodies: labeling immunoglobulins for use as 
probes; immunoaffinity purification; and immunoassays. 

The Transgenic Primary Ropgrtoirft 

A. The Human Immunr^h,,^ | ^ 

An important requirement for transgene function is the generation of a primary antibody repertoire that is diverse 
enoughto tngger a secondary immune response for a wide range of antigens. The size of the human immunoglobulin 
toe. encoding the various gene segments for heavy and light chains is quite large. For example, in the human genome 

« , X m chain l0CUSl *• K li8ht locus and *• heavy chain locus together occupy over 

5 Mb of DMA or almost 0.2% of the entire genome. Each locus consists of multiple variable segments that recombine 
dunng B-cell development with a joining region segment (and. the heavy chain locus with diversity region segments) to 
form complete V region exons. Such rearranged light chain genes consist of three exons: a signal peptide exon a var- 
iable region exon and a constant region axon. The rearranged heavy chain gene is somewhat more complex. H consists 
of a signal peptide exon. a variable region exon and a tandem array of multi-domain constant region regions, each of 
which is encoded by several exons. Each of the constant region genes encode the constant portion of a different class 
of immunoglobulins. During B-cell development. V region proximal constant regions are deleted leading to the expres- 
sion of new heavy chain classes. For each heavy chain class, alternative patterns of RNA, splicing give rise to both 
transmembrane and secreted immunoglobulins. 

Approximately 40% of human serum antibody molecules contain X light chains. The structure of this locus which 
maps to chromosome 22ris the least well characterized (Fig. 2). It consists of an unknown number of V segments 
upstream of a tandem array of six constant region genes, each of which is linked to a single J segment In addition two 
more constant region segments with associated J segments have been isolated, although their linkage with the rest of 
the X cluster has not been established, and it is not known if they are used. E. Seising, et al.. "Immunoglobulin Genes" 
Academic Press, T. Honjo, F.W. Alt and T.H. Rabbitts. eds. (1989). 

ocn ^ ^1 Chain l0CUS iS Spread 0Ut 0ver three clusters 0,1 chromosome 2 (Fig. 3). The first two clusters, covering 
850 and 250 kb respectively contain only variable region gene segments. The third cluster, covering about 1 Mb con- 
tains approximately 40 V gene segments upstream of a cluster of 5 J segments followed by a single constant region 
gene segment A total of 84 V gene segments have been identified, and approximately half of these are thought to be 
pseuoogenes (Zachau (1989) in Immunoglobulin Genes, Academic Press. T. Honjo, F.W. Alt, and T.H. Rabbitts eds do 
91-1 10). Approximately 25 kb downstream of the CK region there is a "k deleting element" (icde). The Kde sequence 
rewmbines with upstream sequences, causing the deletion of the k constant region in X light chain expressing B-cells 
This leads to isotonic exclusion in cells that successfully rearrange both k and X genes 

The human heavy chain locus is the largest and most diverse. It consists of approximately 200 V gene segments 
spanning 2 Mb. approximately 30 D gene segments spanning about 40 kb, six J segments clustered within a 3 kb span 
TxTtT^ ? 0iOn 9ene seamems spread out over approximately 300 kb. The entire locus spans approximately 
- • f? r pcrton * ,ong arm * chrom °so'ne 14 (Fig. 4). The heavy chain V segments can be grouped 
J? ^ 165 °" basis ^^uence similarity. There are approximately 60 members of the V H 1 family. 30 V H 2 seo- 
S f, H , ^T 6 * 5, 30 Vh4 Se9ments " V h5 segments, and one V H 6 segment Berman. J.E.. et al. (1 988) 
msZLL L 727-738. In the human heavy chain locus, the members of individual V families are intermingled, unlike the 
mouse locus where related V segments are clustered. The single member of the VH6 family is the most proximal of the 
V segments, mapping to within 90 kb of the constant region gene segments. Sato. T, et al. (1988), Biochem. Btannvs 
mSiPZ 1 " ^J^TL^ m fUnCti0naI D and J se9mente aPPear to lie in this 90 kb region (Siebenlist et al.' 
22 LSffL 281, 63 " 635: Matsuda " * aL (1988) ' BiBSLL Z 1047-1051; Buluwela, et al. (1988), EMBOJ. 7 
2003-2010; Ichhara. et al. (1988), Ef/MLL Z 4141-4150; Berman. et al. (1988). EMBSLL Z 727-738) 
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r nana Frflgmant Transaenes 
1 . Heavy Chain Transaene 



In a preferred embodiment, immunoglobulin heavy and light chain transgenes comprise unrearranged genomic 
DNA from humans. In the case of the heavy chain, a preferred transgene comprises a Notl fragment having a length 
between 670 to 830 to. The length of this fragment is ambiguous because the 3' restriction ate has not been accurately 
mapped. It is known, however, to reside between the al and V a gene segments (see Fig. 4). This fragmerrt contains 
members of all six of the known V H families, the D and J gene segments, as well as the u. 8. y3, y1 and a1 constant 
reaions Berman et al. (1988), EM BO J.. L 727-738. A transgenic mouse line containing this transgene > correctly 

regions to trigger a secondary response for most antigens. 

-> | jght Chain Transoene 

A genomic fragment containing all of the necessary gene segments and regulatory sequences from a human light 
chain locus may be similarly constructed. Such a construct is described in the Examples. 

C. Transoenes Generated In tracellular^ by In viun Recombination 

It is not necessary to isolate the all or part of the heavy chain locus on a single DNA fragment Thus, for example, 
the 670-830 to Notl fragment from the human immunoglobulin heavy chain locus may be formed iruflya m the non- 
human animal during transgenesis. Sue* injfflffl transgene construction is produced by introducing two or more i over- 
lapping DNA fragments into an embryonic nucleus of the non-human animal. The overlapping portions of the DNA ^frag- 
rnents have DNA sequences which are substantially homologous. Upon exposure to the recombinases contained within 
the embryonic nucleus, the overlapping DNA fragments homologously recombined in proper or.entat.on to form the 
670-830 to Notl heavy chain fragment. 

It is to be understood, however, that iimffitransgene construction can be used to form any number of immunoglob- 
ulin transgenes which because of their size are otherwise difficult or impossible, to make or manipulate by Present 
technology Thus, jnvjvo. transgene construction is useful to generate immunoglobulin transgenes which are arger than 
D^SeTwf^y be manipulated by YAC vectors (Murray and Szostak (1983). Najyre, 305 189-193). Such 
in vivo transgene construction may be used to introduce into a non-human animal substantially the entire immunoglob- 
uli^ioci from a species not consisting of the transgenic non-human animal. Thus, although severa groups have suc- 
cessfully constructed libraries containing 50-200 to of DNA fragments in YAC vectors (Burke et al. (1987). Sosnse, 
23JL 806-812; Traver. et al. (1989). Prop Nflti Acad. Sci, USA. 8S. 5898-5902) and used ^ ^ ne p ^^°" to 
produce YAC libraries ranging in size from 200 to approximately 1000 to (McCormick. et al. (188J Pff^AcaA 
Sci USA. 86, 9991-9995), multiple overlapping fragments covering substantially more than the 670-830 to Notl frag- 
mentofthehuman constant region immunoglobulin loci are expected to readily produce larger transgenes by the meth- 

°* f^aSfontoforming genomic immunoglobulin transgenes, injdyo. homologous recombination may also be utilized 
to form "mini-locus" transgenes as described in the Examples. 

In the preferred embodiments utilizing invjva transgene construction, each overlapping DNA/ragmerrt preferably 
has an overlapping substantially homologous DNA sequence between the end portion of one DNA fragment and toe 
end portion of a second DNA fragment. Such overlapping portions of the DNA fragments preferably compnse aboutSOO 
bp to about 2000 bp, most preferably 1 .0 to to 2.0 to. Homologous recombination of overlapping DNA fragments to 
form transgenes iota is further described in common.y assigned U.S. Patent Application ^ ™^ °£ 
eration ofDNAby Homologous Recombination of DNA Fragments" filed August 29. 1990. under U.S.S.N. 07/574,747. 

D. Minilocus Transoenes 

As used herein, the term "immunoglobulin minilocus" refers to a DNA sequence (which may be within a longer 
sequence), usually of less than about 150 to, typically between about 25 and 100 to, containing at least one each 
the following: a functional variable (V) gene segment a functional joining (J) region segment, a functional consterrt : (C) 
region gene segment, and-if it is a heavy chain minilocus-a functional diversity (D) region segment such , that said 
r DNA sequence contains at least one substantial discontinuity (e.g., a deletion, usually of at least about 2 to 5 to, pref- 
erably 10-25 to or more, relative to the homologous genomic DNA sequence). A light chain minilocus transgene w.11 be 
at least 25 to in length, typically 50 to 60 to. A heavy chain transgene will typically be about 70 to 80 to in lengtMref- 
erably at least about 60 to with two constant regions operably linked to switch regions, versus at least about 30 to with 
a single constant region and incomplete switch regions. Furthermore, the individual elements of the mmilocus are pref- 



12 



EP0814159 A2 

erably in the germline configuration and capable of undergoing gene rearrangement in the pre-B cell of a transgenic 

an '!Hf^ aS to 6XpreSS functional molecules with diverse antigen specificities encoded entirely by the ele- 

ments of the minilocus. 

'Iff? al !f!^ t . e f Jref ! rred embodiment, immunoglobulin heavy and light chain transgenes comprise one or more of 
5 each of the V. D. J and C gene segments. At least one of each appropriate type gene segment is incorporated into the 
minrtocus transgene. With regard to the C segments for the heavy chain transgene. it is preferred that the transgene 
contain at least one u gene segment and at least one other constant region gene segment more preferably a y gene 
segment and most preferably y3 or r 1. This preference is to allow for class switching between IgM and IgQ forms of the 
encoded immunoglobulin to provide for somatic mutation and the production of a secretaUe form of high affinity non- 
'9^ 'rnmun<>gJobulin. Other constant region gene segments may also be used such as those which encode for the pro- 
duction of IgD. IgA and IgE. ^' 

The heavy chain J region segments in the human comprise six functional J segments and three pseudo genes clus- 
tered in a 3 kb stretch of DNA. Given its relatively compact size and the ability to isolate these segments together with 
tfie u gene and the 5 portion of the 6 gene on a single 23 kb SFil/Spel fragment (Sado. et al. (1988). Biochem. Binshvs 
is Res, Comm.. 154, 264271). it is preferred that all of the J region gene segments be used in the ironies construct 
Since this fragment spans the region between the n and 6 genes, it is likely to contain all of the 3' cis-linked regulatory 
elements required fa \i expression. Furthermore, because this fragment includes the entire J region, it contains the 
heavy chain enhancer and the » switch region (Mills, et al. (1983). Nature. 306. 809; Yancopoutos and Alt (1986) Ann. 
Rev. Immunol. , i, 339-368). It also contains the transcription start sites which trigger VDJ joining to form primary rep- 
ertoire B-cells (Yancopoulos and Aft (1985). QsH SSL 271-281). Alternatively, a 36 kb BssHII/Speh fragment which 
includes part on the D region, may be used in place of the 23 kb Sfil/Spell fragment. The use of such a fragment 
increases the amount of 5' flanking sequence to facilitate efficient D-to-J joining. 

k. . ° reflion consiste 01 4 or 5 homol °8°"s 9 •* subregions, linked in tandem (Siebenlist. et al. (1981) 

NaiurjL 29A 631-635). Each subregion contains up to 10 individual D segments. Some of these segments have been 
mapped and are shown in Fig. 4. Two different strategies are used to generate a mini-locus D region. The first strategy 
involves using only those D segments located in a short contiguous stretch of DNA that includes one or two of the 
repeated D subregions. A candidate is a single 15 kb fragment that contains 12 individual D segments. This piece of 
, « f??^°l 2 Gon * UOUB EcoR| fragments and has been completely sequenced (Ichihara. et al. (1988). EMBCLL 
I <J41-4150). Twelve D segments should be sufficient for a primary repertoire. However, given the dspersedmJure" 
of the D region, an alternative strategy is to ligate together several non-contiguous D-segment containing fragments to 
produce a smaller piece of DNA with a greater number of segments. 

t™* preferab| y more one v 9ene segment is used to construct the heavy chain minilocus trans- 
? 6 ?!'^ , . p,6Ce A containin 9 one or *"» ^rearranged V segments together with flanking sequences is 
isolated. A clone containing such DNA is selected using a probe generated from unique 5'. sequences determined from 
to transcrfced V region of a characterized human hybridoma such as that which produces anti-cytomegalovirus anti- 
body (NewkirkfiU!. (1988) J, Clin, Invest . 6L 1511-1518). The S untranslated sequence of the heavy chain mRNAis 
used to construct a unique nucleotide probe (preferably about 40 nucleotides in length) for isolating the original germ- 
line V segment that generated this antibody. Using a V segment that is known to be incorporated in an antibody against 
a known antigen not only insures that this V segment is functional, but aids in the analyse of transgene participatJon in 
secondary immune responses. This V segment is fused with the minilocus D region and constant region fragments dis- 
cussed previously, to produce a mini-locus heavy chain transgene. 

AHernatively, a large, contiguous stretch of DNA containing multiple V region segments is isolated from a YAC 
library- Different sized pieces of DMA. containing different numbers of V region segments, are tested for their ability to 
provKJe a human antibody repertoire in the minilocus transgene construct It is also possible to build one large fragment 
from several non-contiguous V segment containing fragments using YAC vectors (Murray and Szostak (1983) Nature. 
305. 189-193), F factor-based plasrrrids (O'Conner. et al. (1989). Sctencs. 244, 1307-1312) or the aforementioned in 
ma coretrucbon using recombination of overlapping fragments. Aftemativefy. a synthetic V region repertoire 
(described hereinafter) may be used. 

A minilocus light chain transgene may be similarly constructed from the human X or k immunoglobulin locus. Con- 
struction of a k light chain mini-locus is very similar to construction of the heavy chain mini-locus, except that it is much 
simpler because of its smaller size and lower corrplexity. The human k locus contains only one constant region seg- 
ment; and this segment together with 5 1 and 3" enhancers, and all 5 of the functional J segments, can be isolated on a 
single 10 kb DNA fragment This fragment is co-injected together with a minilocus V region constructed as described 
tor the heavy chain minilocus. 

u n 1 ?"!^ example ' *" immunoglobulin heavy chain minilocus transgene construct eg., of about 75 kb encoding 
L I an ° COnstant re8,on sequences 080 °e formed from a plurality of DNA fragments, at least two, three or four of 
which each are ether a V region sequence, a D region sequence, a J and constant region sequence, a D and J and 
constant region sequence or a constant region sequence, with each sequence being substantially homologous to 
human gene sequences. Preferably, the sequences are operaWy finked to transcription regulatory sequencesand are 
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capable of undergoing rearrangement. With two or more appropriately placed constant region sequences (e.g., n and 
y) and switch regions, switch recombination also occurs. An exemplary light chain transgene construct s.m.larly formed 
from a plurality of DNA fragments, substantially homologous to human DNA and capable of undergoing rearrangement 
will include at least two. three or four DNA fragments, encoding V. D and constant regions, each fragment comprising 
either a V region sequence. J and constant region sequence or a constant region sequence. 

E Methods for Determining Funeral V Gene Sonmsnts and for Generating Synthetic V Segment Repertoire 

Of the various families of gene segments, i.e.. V. D, J and C region gene segments, the number of V gene segments 
generally far surpasses the number of corresponding gene segments for the D. J and C region gene segments. By anal- 
ogy to the rabbit system wherein a single V gene segments is utilized by approximately 90% of the antibodies produced 
(Knight and Becker (1 990). £ejl. 6Q, 963-970). it is possible to produce heavy and light transgenes containing a limited 
number of V region gene segments, and as few as one V region gene segments. Therefore, it is desirable to have a 
method to determine which V region gene segments are utilized by a particular organism, such as the human being, 
when mounting an immunoglobulin-mediated immune response. According to this approach, a single V gene segment 
when combining with the J or DJ gene segments is capable of providing sufficient diversity at CDR3 for the generation 
of a primary repertoire which upon somatic mutation is able to provide further diversity throughout the variable region, 
eg at CDR 1 and CDR2 for the production of high affinity antibodies. 

In this aspect of the invention, methods and vectors are provided for determining which V gene segments are com- 
monly utilized by an organism during an immune response. This method is based on determining which V segments 
are found in cDNA synthesized from B-cell poryA+ RNA. Such methods and vectors may also be used to faclrtate the 

construction of a synthetic V segment repertoire. . re ^ ar 

The outline of this strategy for identifying heavy chain V segments and for generating a synthetic V segment reper- 
toire is depicted in Fig.s 5 and 6. It is similarly applicable tor identifying light chain V segments with appropriate modrf.- 
cation The first step is the construction of a cloning vector. The preferred starting material is a DNA ragmerrt 
(approximately 2 Mb) containing an Prearranged V segment together with 5' and 3' flanking sequences. The fragment 
isdoned into a plasmid such as pGP1 or pGP2 described hereinafter containing a polylinker site flanked by the rare 
cutting restriction sites designated "W and "z" in the Figs. 5 and 6 (the polylinkers and restriction sites of pGP1 and 
DGP2 are described in the Examples). Oligonucleotide directed mutagenesis is then used to introduce two new restnc- 
tion sites V and V (generally each about 6 nucleotides in length). Restriction site "x" is placed approximately 20 
nucleotides from the 3' end of the intron between the signal and V segment exon. Restriction site "y" is placed approx- 
imately 20 nucleotides 3' of the V segment junction, within the 23 bp spacer between the heptamer and nonomer 
recombination signal sequences. Cutting the resulting plasmid with enzymes "x" and "y" removes the second exon (V 
segment), leaving the 5' flanking sequences, the V region promoter, the signal peptide exon. the intron, a gap f anked 
by "x" and "y" ends, the outside half of the recombination signal sequence, and the 3' flanking sequences. This plasmid 

' S ^The second step is the synthesis of four sets of oligonucleotide primers. P1 through P4. P1 and P2 are non-unique 
oligomers having approximately 50 nucleotides each which are used to prime double stranded cDNA synthesis. P1 
starts (going 5' to 3 1 ) with about 20 nucleotides of sequence homologous to the antisense strand of the recombination 
signal sequence in pVH1 (including the recognition sequence of restriction enzyme "y"). and continues with approxi- 
mately 30 nucleotides of antisense sequence hybridizing with about the last 30 nucleotides of the VH framework region 
3 (FR3) Random bases are incorporated over about the last 30 nucleotides so as to generate a set of primers that 
hybridize with all of the different VH families. The second oligonucleotide. P2, is in the sense orientation, and is homol- 
ogous to the approximately 50 nucleotides beginning with the restriction site "x" in pVH1 . This includes the V restric- 
tion site about the last 20 nucleotides of the intron. and about the f irst 30 nucleotides of FR1. Again, about the last 30 
nucleotides are non-unique so as to accommodate different VH region segments. Oligonucleotides P3 and P4 are 
homologous to about the first 20 nucleotides of P1 and P2 respectively. These oligos are unique so as to avoid intro- 
ducing new mutations into the V segments and are used to amplify double stranded cDNA by way of the polymerase 

chain reaction (PCR). . . . 4 . u . . ^ 

The 3' terminal portions of primers P1 and P2 which are capable of hybridizing to and priming the synthesis of the 
variable segments of the heavy or light immunoglobulin locus may be readily determined by one skilled in the art. For 
example the nucleotide sequence for a number of human VH genes have been published see e.g. Berman, J.E.. et al. 
(1988) EMBO J.. L 727-738 and Kabat, E.A., et al. (1987). Sequences of protfiin Of Immunological interests OS. 
Dept Health & Human Services, Washington, D.C. Similarly, when used to identify and/or generate V segments of the 
human light immunoglobulin locus, the appropriate 3' sequence portions of primers P1 and P2 may readily be deter- 
mined from published sequences. See e.g Kabat E.A.. et al.. suca In general, those nucleotide positions which are 
conserved amongst various V segments are also conserved in Ihe 3' portion of the P1 and P2 primers. For those nucle- 
otide positions wherein variation is observed amongst variable segments, such nucleotide positions in the correspond- 
ing P1 and P2 primers are similarly varied to provide P1 and P2 primers which comprise a pool of primers which are 
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capable of hybridizing to different VH or VL segments. 

The next step is to use these oligonucleotide primers to generate a library of human heavy-chain V-reoion cDNA 
P ^ 1P1 iS to f « C °NA syrthSs from hurna^l ^ 
RNA is basehydrolyzed, and second strand synthesis primed with P2. Full length, double stranded cONaTs Sen pun" 
f ,ed on aay1am.de gel. electroeluted. and used as template for polymerase dSn reactionjcm 2npHicW,S 

sssss^^ p t M z at *% cdna fe f iret synthesi2ed * -^^sjstss! 

usedasatemplateforthePI pnmed reactor. The amplified product (approximately 0.3 Kb) is then gel purified cleaved 
with restriction enzymes x" and y, and cloned into pHVI . 8 ^ 1 

, ari JL e „ r f ^f" 9 represente a genomic library of variable region segments and offers three 

^ over a conventual genomic library of variable segments. First, this library contains no pseudogenes wnBe 
ZTr° n Jl br ^r UW ^ "P to 5°* Pseudogene sequences. Second, the synthetic Zy S SISSS 
T C0 ^°^ al J*"* C0n1aininfl one functional V segment per 2 Kb of DNA. as opposed to ™ SSK 
•* TT* l6aV8S *• V 86flment P™**" sequences acceSSo XuE 
Such a cDNA library may be biased towards particular germline V segments because of differential expression 
The two sour<^ofb.asare:(0 differential rates ofV 

B^TSi ^ ^ d *• iS ^ "* in ^ *» fe 

P^T.»!SSE £ prono " nced in the immunoglobulin repertoire. Second, the semi-random primers 
2j£ Si * 0 n ,T° PO ° 16 ' 680,1 01 which se,ecBve, y cross-hybridizes with different V segment families Th2 

seXrT^l^ 10 9 i n ! ra,e ll° 6 Separate ,ibraries ' insurino that 8,1 «* *• V region families a?e rep^ 
sented. The second source of bras, differential selection of B-cell clones, is also dealt with in two analogous wavs First 

IZZ* J*,™** 8 ' indudes "**«■» fraction of antigen selected Emails is used. Lynph SSSSS 
^Snff l^ ^ 18 °. n8 ^ 01 UnS6lec,ed B - ce,ls - However . » may contafnThigh proportiS of 3 
S2f iTESr SeqUenCeS ,r ° m Pre " B " Ce " S - An0lh6r 80uree °* RNA is wh0,e Ninety pe^Srof circulaTg 
seS* XSS^ZZ e r eSSin9 H Cel, L and ^ reCSn1 00,16 m " row HoweSe? the level of 

RNA^S^l^ 5^ ^ ^ k ,RB ° n tte immUne 8,316 ° f 1,16 individual - T**"*"*' pofrAt 
RNA is crocked for selected B-cell sequences by northern blot hybridization with r specific probes If it is more oractical 

sssr D ri a high * b * b 01 *° sequences - ■ ■s- -s^hZrsss 

matT^t w .1m 5 °L o A 8yntheS ' S 18 pr,med *** 8001,1 a 40 constant-region axon 2 primer 

V?rnTT^ SyntheS6S is P rimed with P2. and a thirdTound of syrtheste 

pnmed with P1 . The cDNA from th.s th,rd round of synthesis provides the template for PCR amplification usingPSand 

tPch^f!^ 8016 'T" h8S b66n aenera,6d ' V 689menl8 used * erein may by identified by standard 
wSas 22X5? ?T2?» hy0ridiz8,ion ^ *"* *«* or segment Sic oligonudeoTd^ 
^'^f °y p CR methods. Such characterization of the V segment library provtoes information 
EZln£ qUen * ^ l^ 0 " 01 V S69ment ^ ationin aP^cular organism and asaco4^ 
m?e or^ZTv a ^f ^ 1^ I** e0n,-niclta rt *• ^ ^-genes of the invertioT^us ZTor 
™Z P 1 ? ? 9e " 6 se fl mente ma y 06 "sed in the above described mini-locus Iransgene construct Further 

tofSS^ 

to faalrtateKlentificaton of genomic fragments containing a particular desired V segment 

r^f^ Sy T^ V 8eflment reDert0ire ^structed by concatenation of the library sequences Large 
« 9 J ST 8 e " e „ tendem hundreds of copies of the injected sequence, are oom£nK ^nenS 

^! C Jr iCe - 71,686 tendem ^ are ^"y stable^owever. to ensured £5?5 
blocks of random DNA are prepared by digesting and then religating genomic DNA, so as to prevent the insertion^ 

m^^fSc TAn* 9881 P,0dUC68 ended fra9ment8 80 length of 64 nucleotides fW 

mente in the aze range of 50 to 100 nucleotides are eluted from an acrylamide gel. and religated The related DNA 

or Bglllsrte of the polylinker of the vector used to generate pVH1. 

The random sequence library is combined with the synthetic V segment library to create a synthetic V segment rec- 
ertojre. Inserts from the random sequence library are released with the enzymes V and Y ^ pur^IZS 
vector sequences. Inserts from the synthetic V segment library are isolated bycuttng with V and "z" BeforHSna 

V segment inserts, this DNA is treated with calf-intestinal phosphatase, to prevent self ligato. ThTv seWrS 
inserts are then hgated together with the random inserts to generate an ahernating tarxfem a^y ™ ri sin g a S 

^TITZ", ™ S ' i9ati0n ™T B fe Size Se,ected °" « suaose gradierl and the^S %E£ 
jected together wrth. fa example, a D-J^onstant miniMocus construct By directly injecting the synthetic V segmentX 
toZTZT ,rten :r 9d0nin9 ^ « is possible to take advantage of 5J fact L taS fa^TSS 
? ^ T» ,nSe ^ 31 8 an9,e ^ Ws 0886 ^ tandem ^ are not completely redurSant but leSto 
further drversrty. AJtematvdy. the synthetic V segment repertoire may be combined Sha D^SSS Mto torn, a 
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heavy chain transgene. . 

A synthetic light chain immunoglobulin segment repertoire may be similarly constructed using appropnate pnmers 

for the light chain locus. 
5 functional Disruption of End nnenous Immunoglobulin Loci 

The expression of successfully rearranged immunoglobulin W ,wn ^^ te r^ to .^^^ 
InarSeX ^pressing the rearrangement of the endogenous immunoglobulin genes <^<m»nortum 
a^li^irenanother way to generate a nonhuman that is devoid of endogenous antibodies is 
enSeno^ immunoglobulin loci. Using embryonic stem cell technology and homologous recombination the endog- 
£ZZ£?£ repertoire can be readily eliminated. The following «^^J2S^S!^ nS 
mouse immunoglobulin loci. The vectors and methods disclosed, however, can be read.ly adapted for use in other non 

^Bn^ technology involves the inaction of a gene, by homologous recomWnatton. in a plunder, I celHine 
thatf^^ 

ZlobS aene is introduced into the nuclei of embryonic stem cells. In a portion of the cells, the introduced DMA 
SSSS S»> copy of the mouse gene, replacing I wrth the altered copy. Celte contamins , the mew* 
eSneeS genetic lesionare injected into a host mouse embryo, which is reimplanted into a <^*™^™« 
mTeSyS de.elop into chimeric mice ,r«t possess germ cells ertirelyde^ 

by ZSte chimeric nice H is posstole to obtain a new line of mice containing the .ntroduced genetic les.cn 

K -light^nlod is sufficient. There arethree ways to disrurM each ofth«^ 
jSSL enhancer, and disruption of constant region coding sequences by the reduction of 
option is the most straightforward, in terms of DNA construct deagn. Elimination of the p ugene d.s ru P* B *^ u ' a 
preventing class switching to any of the functional heavy chain segments. The strategy for knocking out 

^TolmSe mo^" and k genes, targeting vectors are used based on the design employed by Jaenisch and 
^Z ^2T^7\ Nature 342, 435-438) for the successful disruption of the mouse ^-microglobulin 

"n2^ 

aene The pMCIneo insert uses a hybrid viral promoter/enhancer sequence to drive neo ^ a ° n ^™"^ 0 ^* ' s 
SJeJernbr^stem cells. Therefore, neo can be used as a selectable marker for integration of the 
Suet The HSV thymidine kinase (tk) gene is added to the end of the construct as a negative select™ marker against 

ing *e * the elimination of the J region. This region is fairly compact in the mouse, spanning only 

!s kb To^nsfruct a gene targeting vector, a 15 kb Kpnl fragment containing all of the secreted A ("MOO" 
IxoS'from^ouse genomic library is isolated. The 1.3 kb J region is replaced with the 1.1 kb insert from pMCIneo. The 
™£IESIE! to the S end of the Kpnl fragment. Correct integration of 

Tecombination. will result in the replacement of the mouse J H region with the neo.gene (F,g J) . Recon* jants are 
Seened by PCR. using a primer based on the neo gene and a primer homologous to mouse sequences 5 of the Kpnl 

8rte ^ernatiSe heavy-chain locus is knocked out by disrupting the coding region of the ^ gene. This ^pproach 
involve Z Se 1 5 kb K?nl fragment used in the previous approach. The 1 .1 kb insert from pMCInec ns a 
ur q uTtemH. site in exorTll. and the HSV tkgene added to the 3' Kpnl end. 

the neo insert, that eliminate the tkgene, are then selected for. These are ^^^^SSSm- 
PCR amplification. One of the PCR primers is derived from neo sequences and the other from mous e 
side .5 »e targeting vector. The functional disruption of the mouse immunoglobulin loci is presented m the Examples. 

Mor,.n.nti an Anim ?* mntaininn Rearranged Immunoglobulin Heavy and Light Transgepes 

A oremise underlying the previously discussed transgenic animals containing prearranged mini-tocus Ig trans- 
oene^rS is possible to generate a complete antibody repertoire without including all of the ^ 
meTs *ZH n th^tural immunoglobulin locus. Theoretically, it is possible to reduce the ^"*nrt 
siuences that contribute to the primary repertoire without reducing the secondary repertoire. As long a there ,s 
•Z*d»X" the primary repertoire to trigger a T-ce.1 dependent response for any given antigen, somatic hyper- 
mutation should be capable of delivering a high affinity antibody against that antigen. ^ 

tS Snce* isSen a step funWin this aspect of the invention wherein a full M?^ h '^£"£" 
is genMM iely by somatic mutation. The antigen combining site to created by the interface between the amino- 
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terminal heavy chain domain and the amino-terminal light chain domain. The CDR1, 2 and 3 residues within each of 
these domains that interact with the antigen are located on three different loops that connect p strands. As previously 
described, these regions have the greatest sequence diversity between different antibody molecules recognizing differ- 
ent antigens. Thus, the antibody repertoire is determined by sequence diversity at CDR1 f 2, and 3. The diversity at 
CDR1, 2, and 3 that gives rise to a complete antibody repertoire comes from three sources: recombinational diversity 
junctional diversity, and somatic mutatioa Recombinational diversity at CDR1 and 2 comes from the choice of different 
V segments containing different CDR1 and 2 sequences. Recombinational diversity at CDR 3 comes from the choice 
of different D and J segments. Junctional diversity contributes only to CDR3 diversity, while somatic mutation, acting 
across the entire V region, contrfcutes to diversity at all three complimentarity determining regions. Recombinational 
and junctional diversity together constitute the diversity of the primary repertoire (Fig. 1). Thus VDJ joining generates a 
set of IgM expressing primary B-ceJIs. 

Any primary repertoire B-cell that expresses a cell surface IgM molecule with a certain minimal affinity for a foreign 
antigen, internalizes that antigen as IgM and cycle off the cell surface. The antigen is then processed and associated 
peptides are presented on the cell surface by class II MHC molecules. If enough foreign antigen is presented at the cell 
surface this, triggers a T-cell response that in turn triggers the T-cell dependent maturation of the B<eil. This is the so- 
called secondary response (Fig. 8}. Part of this response involves the hypermutation of the variable portion of the immu- 
noglobulin genes. Thus a B-cell clone undergoing a secondary response constantly gives rise to new clones with 
altered immunoglobulin molecules. Those clones with higher affinities for the foreign antigen are selectively expanded 
by helper T-cells, giving rise to affinity maturation of the expressed antibody. Because somatic hypermutation takes 
place across the entire V region, there is no theoretical limit to the process of affinity maturation. 

In this aspect of the invention, CDR1 and 2 diversity is not necessary for generating a complete antibody response. 
Rather, diversity at CDR3. created by VJ and VDJ joining provides sufficient minimal affinity to trigger the T-cell depend- 
ent maturation to give rise to high affinity antibodies for a large number of different antigens. Thus, methods and trans- 
genic animals are provided for generating a broad antibody repertoire without primary diversity. Such diversity relies on 
somatic mutation for the generation of antibody diversity. During the process of affinity maturation, somatic mutation 
gives rise to a large number of clones with lower, rather than higher, affinities for the stimulating antigen. Most of these 
clones are not selected for and die off. However, if one of these clones has affinity for a new antigen that is also present, 
this clone expands and undergoes affinity maturation for the new antigen (Fig. 9). In this aspect of the invention, a trans- 
genic non-human animal, such as a mouse, with rearranged human heavy and light chains combine to form an antibody 
that has a low affinity for a known antigen, ff this animal is injected with the known antigen, its B-cells undergo a sec- 
ondary response leading to the production of high affinity antibodies for that antigen. However, if this mouse is first 
injected with a mixture of the known antigen and a new antigen, and then subsequently challenged with the new antigen 
alone, high affinity antibodies against the new antigen are produced by the branching process described above. This 
approach has two major advantages: first the transgene constructs are easy to generate; and second, the rearranged 
transgenes are capable of allelidy and isotypically excluding the rearrangement of the endogenous mouse genes, thus 
making it unnecessary to eliminate those genes by homologous recombination as previously described. 

The first step in this embodiment of the invention is the isolation of rearranged heavy and light chain genes from a 
human hybridoma that expresses an IgM antibody directed against a known antigen. The ideal hybridoma recognizes 
a readily available antigen that is capable of generating a good mouse T-cell response. There are a number of such 
human hybridomas in existence, including several that react with promising antigens such as tetanus toxoid, pseu- 
domonas, or gram negative bacteria (reviewed by James and Bourla (1987), J. Immunol. Methods 1QQ, 5-40). The 
entire rearranged heavy chain gene is isolated on a single piece of DNA (approximately 20 kb) while the rearranged k 
light chain gene, including the 3" enhancer, is isolated on a second DNA fragment (about 20 kb). Each of these frag- 
ments are pieced together from clones isolated from a phage X library made from DNA isolated from the hybridoma. 
Two constructs are generated, a heavy chain construct and a light chain construct. 

The heavy chain construct (Fig. 1 0) consists of the 20 kb hybridoma fragment, containing the rearranged IgM gene, 
ligated to a 25 kb fragment that contains the human y3 and y1 constant regions followed by a 700 bp fragment contain- 
ing the rat heavy chain 3' enhancer (Pettersson, et al. (1 990). Nature. 2M. 165-168). The light chain construct consists 
of the intact 20 kb piece of DNA containing the rearranged k chain and 3' enhancer. These two constructs are coin- 
jected so that they are integrated at a single site in the mouse genome. Transgenic mice are tested by Northern blot 
analysis for expression of the transgene mRNA. FACS analysis is then carriedout on tail blood samples to detect cell 
surface expression of the transgene encoded protein. Mice are then immunized with the antigen recognized by the orig- 
inal hybridoma. ELISA and FACS analysis are carried out on tail blood to detect class switching. Finally, the mice are 
tested for their ability to respond to a number of different antigens by co-injecting a panel of antigens together with the 
original antigen. Tail Wood are analyzed by ELISA to detect the production of high affinity human IgQ antibodies 
directed against individual antigens. 

To use this transgenic mouse to generate human antfcodies directed against a given antigen, that antigen prefera- 
bly is first coinjected together with the antigen associated with the hybridoma from Which the genes were isolated. This 
hybridoma associated antigen is referred to as the co-antigen (sometimes as a second antigen), and the new antigen 
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simply as the antigen (or first antigen). If possible, the second antigen is chemically cross-linked to the first antigen prior 
to injection This causes the first antigen to be internalized and presented by the primary transgene presenting B-cells, 
thus ensuring the existence of a pool of activated helper T-cells that recognize the first antigen. A typical immunization 
schedule is as follows. Day 1 : Mice are injected ip with first antigen mixed with, or cross-linked to. second antigen in 
complete Freunds adjuvant. Day 14: first antigen (without second antigen) is injected ip in incomplete Freunds adjuvant. 
Day 35- repeat injection with first antigen in incomplete Freunds. Day 45: Test for antibody response by ELISA on tail 
blood samples. Day 56: repeat injection of good responders with antigen in incomplete Freunds. Day 59: Fuse spleens 
of good respond ers. 

In an alternate aspect of this invention, the antigen recognized by the hybridomafrom which the Ig genes were iso- 
lated is used as an immunogen. New transgenic hybridomas are then isolated from the immunized animal that express 
somatically mutated versions of the original antibody. These new antibodies will have a higher affinity for the original 
antigen This antibody "sharpening" procedure can also be applied to antibody genes generated by CDR grafting (EP. 
Pub. No. 239400, published Sept. 30, 1987) or isolated from bacterial (W.D. Huse et al. (1989) Science , 246, 1275) or 
phage (T. Clackson eft al. (1991) Nature 252. 624) expression libraries. 

Transgenic Non-Human Animate Containing R e arranged and Unrearranged Immunoglobulin HeflVY nlTdfor light TranS- 
qene 

The previous embodiments described the use of fully rearranged or fully unrearranged heavy and light immu- 
noglobulin transgenes to produce transgenic non-human animals capable of producing a heterologous antibody. In a 
further aspect of the invention, transgenic animals contain at least one rearranged and at least one unrearranged immu- 
noglobulin transgene are produced by utilizing any of the aforementioned unrearranged and rearranged transgenes in 
combination to provide heavy and light transgenes in the transgenic animal. In this regard, the unrearranged transgene 
may comprise a heavy or light genomic or mini-locus transgene construct with the rearranged transgene comprising an 
appropriate rearranged transgene. For example, if a unrearranged mini-locus light chain transgene is used, the appro- 
priate other transgene is a fully rearranged heavy chain transgene. It is preferred, however, that the rearranged trans- 
gene comprise a rearranged immunoglobulin light chain transgene and that the unrearranged transgene comprise an 
immunoglobulin heavy chain genomic or mini-locus transgene, most preferably an unrearranged heavy chain transgene 
with associated A and y constant regions. , u - *u 

The combination of rearranged and unrearranged transgene provides an intermediate level of diversity within the 
primary repertoire B-cells. Thus, although primary diversity at CD1 , CD2 and CD3 in the rearranged transgene is fixed 
in the primary repertoire B-cell, the primary diversity at the CDR1 , CDR2 and CDR3 produced by the rearrangement of 
the unrearranged transgene provides a population of primary repertoire of B-cells having greater potential diversity than 
the B-cell clone obtained when rearranged heavy and light transgenes are used. Such primary diversity provides broad- 
ened secondary diversity when such cells respond to foreign antigen by way of somatic mutation. 

Nucleic Acids 

The nucleic acids, the term "substantial homology" indicates that two nucleic acids, or designated sequences 
thereof when optimally aligned and compared, are identical, with appropriate nucleotide insertions or deletions, in at 
least about 80% of the nucleotides, usually at least about 90% to 95%, and more preferably at least about 98 to 99.5% 
of the nucleotides. Alternatively, substantial homology exists when the segments will hybridize under selective hybridi- 
zation conditions, to the complement of the strand. The nucleic acids may be present in whole cells, in a cell lysate, or 
in a partially purified or substantially pure form. A nucleic acid is Isolated" or "rendered substantially pure" when puri- 
fied away from other cellular components or other contaminants, e.g., other cellular nucleic acids or proteins, by stand- 
ard techniques, including alkaline/SDS treatment. CsCI banding, column chromatography, agarose gel electrophoresis 
and others well known in the art. Sfifi, F. Ausubel, et al., ed. Ourrent Protocols in Molecular Biology, Greene Publishing 
and Wiley-lnterscience, New York (1987). 

The nucleic acid compositions of the present invention, while often in a native sequence (except for modified 
restriction sites and the like), from either cDNA, genomic or mixtures may be mutated, thereof in accordance with stand- 
ard techniques to provide gene sequences. For coding sequences, these mutations, may affect amino acid sequence 
as desired In particular, DNA sequences substantially homologous to or derived from native V, D, J t constant, switches 
and other such sequences described herein are contemplated (where "derived" indicates that a sequence is identical 
or modified from another sequence). . -w 

A nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid 
sequence. For instance, a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of 
the sequence. With respect to transcription regulatory sequences, operably linked means that the DNA sequences 
being linked are contiguous and, where necessary to join two protein coding regions, contiguous and in reading frame. 
For switch sequences, operably linked indicates that the sequences are capable of effecting switch recombination. 
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Specific Preferred Embodiments 

A preferred embodiment of the invention is an animal containing a single copy of the transgene described in Exam- 
ple 14 (pHC2) bred with an animal containing a single copy of the transgene described in Example 16, and the offspring 
bred with the JH deleted animal described in Examples 9 and 12. Animals are bred to homozygosity for each of these 
three traits. Such animals have the following genotype: a single copy (per hapioid set of chromosomes) of a human 
heavy chain unrearranged mini-locus (described in Example 14). a single copy (per hapioid set of chromosomes) of a 
rearranged human k light chain construct (described in Example 16), and a deletion at each endogenous mouse heavy 
chain locus that removes all of the functional JH segments (described in Examples 9 and 12). Such animals are bred 
with mice that are homozygous for the deletion of the JH segments (Examples 9 and 12) to produce offspring that are 
homozygous for the JH deletion and hemizygous for the human heavy and light chain constructs. The resultant animate 
are injected with antigens and used for production of human monoclonal antibodies against these antigens. 

B cells isolated from such an animal are monospecific with regards to the human heavy and light chains because 
they contain only a single copy of each gene. Furthermore, they will be monospecific with regards to human or mouse 
heavy chains because both endogenous mouse heavy chain gene copies are nonfunctional by virtue of the deletion 
spanning the JH region introduced as described in Example 9 and 12. Furthermore, a substantial fraction of the B cells 
will be monospecific with regards to the human or mouse light chains because expression of the single copy of the rear- 
ranged human k light chain gene will alielically and isotypically exclude the rearrangement of the endogenous mouse 
k and lambda chain genes in a significant fraction of B-cells. 

The transgenic mouse of the preferred embodiment will exhibit immunoglobulin production with a significant reper- 
toire, ideally substantially similar to that of a native mouse. Thus, for example, when the endogenous Ig genes have 
been inactivated, the total immunoglobulin levels will range from about 0.1 to 10 mg/ml of serum, preferably 0 5 to 5 
mg/ml. ideally at least about 1 .0 mg/ml. When a transgene capable of effecting a switch to IgG from IgM has been intro- 
duced into the transgenic mouse, the adult mouse ratio of serum IgG to IgM is preferably about 10:1. Of course, the IgG 
to IgM ratio will be much lower in the immature mouse. In general, greater than about 10%, preferably 40 to 80% of the 
spleen and lymph node B cells express exclusively human IgG protein. 

The repertoire will ideally approximate that shown in a non-transgenic mouse, usually at least about 10% as high, 
preferably 25 to 50% or more. Generally, at least about a thousand different immunoglobulins (ideally IgG), preferably 
10 to 10 or more, will be produced, depending primarily on the number of different V J and D regions introduced into 
the mouse genome. These immunoglobulins will typically recognize about one-half or more of highly antigenic proteins 
including, but not limited to: pigeon cytochrome C, chicken lysozyme, ppkeweed mitogen, bovine serum albumin, key- 
hole hmpit hemocyanin, influenza hemagglutinin, staphylococcus protein A. sperm whale myoglobin, influenza neu- 
raminidase, and lambda repressor protein. Some of the immunoglobulins will exhibit an affinity for preselected antigens 
of at least about 10- 7 M* 1 . preferably 10* 8 M 1 to 10* 9 M* 1 or greater. 

Although the foregoing describes a preferred embodiment of the transgenic animal of the invention, other embodi- 
ments are defined by the disclosure herein and more particularly by the transgenes described in the Examples. Four 
categories of transgenic animal may be defined: 

I. Transgenic animals containing an unrearranged heavy and rearranged light immunoglobulin transgene. 

II. Transgenic animals containing an unrearranged heavy and unrearranged light immunoglobulin transgene 
ML Transgenic animal containing rearranged heavy and an unrearranged light immunoglobulin transgene, and 
IV. Transgenic animals containing rearranged heavy and rearranged light immunoglobulin transgenes. 

Of these categories of transgenic animal, the preferred order of preference is as follows I > II > III > IV. 
45 Within each of these categories of the transgenic animal, a number of possible combinations are preferred. Such 
preferred embodiments comprise the following: 

Category I 

so (a) Example 1 and 2 or 1 9 and 20 animal bred with Example 7 or 1 6 animal. 

(b) Example 1 or 19 fragment coinjected with Example 7 or 16 fragment 

(c) Example 5 (H, I or J) or 14, 17 or 21 animal bred with Example 7 or 16 animal. 

(d) Example 5(H) or 1 4 construct coinjected with Example 7 or 1 6 construct. 

(e) All of the above bred with the animal of Example 9 or 1 1. 12 or 13. Particularly preferred embodiments are all 
55 of the above bred the with animal of Example 9 or 1 2 or 13. 

Catsgpry II 

(a) Example 1. 2. 19 or 20 animal bred with Example 6, 3, 4, 16, 22 or 23 animal. 
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(b) Fragment in Example 1 or 19 coinjected with fragment in Example 2 or 20. 

(c) Example 5 (H, I or J) or 1 4, 1 7 or 21 animal bred with Example 6(B, C or D) or 1 6 animal. 

(d) Construct 5(H) or 14 coinjected with construct 6(B) or 16. 

(e) Animal of Example 1 , 2, 1 9 or 20 bred with animal of Example 6(B, C or D) or 1 6. 

(f) Animal of Example 3, 4, 22 or 23 bred with animal of Example 5(H. I or J) or 1 4, 1 7 or 21 . 

(g) All of the above bred with animal of Example 9, 10, 1 1 . 12 or 13. 

Category III 

(a) Example 3, 4, 22 or 23 animal bred with Example 8 or 1 5 animal. 

(b) Example 3 or 23 fragment coinjected with Example 8 or 1 5 fragment. 

(c) Example 6{B, C or D) or 1 6 animal bred with Example 8 or 1 5 animal. 

(d) Example 6(B) or 15 construct coinjected with Example 8 or 15 construct. 

(e) All of the above bred with animal of Example 9 to 1 3. 

Category IV 

(a) Animal of Example 7 or 1 6, bred with animal of Example 8 or 1 5. 

(b) Construct of Example 7 or 1 6 coinjected with construct of Example 8 or 1 5. 

(c) All of the above bred with animal of Example 9 to 1 3. 

The following is presented by way of example and is not to be construed as a limitation to the scope of the claims. 



METHODS AND MATERIALS 

Transgenic mice are derived according to Hogan, et aL, "Manipulating the Mouse Embryo: A Laboratory Manual", 
Cold Spring Harbor Laboratory. . 

Embryonic stem cells are manipulated according to published procedures (Teratocarcinomas and embryonic stem 
cells: a practical approach, E.J. Robertson, ed. ( IRL Press, Washington, D.C., 1987; Zjilstra, et at. (1989), Ngmifi, 342, 
435-438; and Schwartzberg, P., et al. (1989), Science, 246, 799-803). 

DNA cloning procedures are carried out according to J. Sarrfcrook, et al. in Molecular cloning: A Laboratory Man- 
ual 2d ed., 1989, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y 

' Oligonucleotides are synthesized on an Applied Bio Systems oligonucleotide synthesizer according to specifica- 
tions provided by the manufacturer. 

Hybridoma cells and antibodies are manipulated according to "Antibodies: A Laboratory Manual , Ed Harlow and 
David Lane, Cold Spring Harbor Laboratory (1988). 



EXAMPLE 1 



Genomic Hea vy Chain Human Iq Transaene 

This Example describes the cloning and microinjection of a human genomic heavy chain immunoglobulin trans- 
gene which is microinjected into a murine zygote. 

Nuclei are isolated from fresh human placental tissue as described by Marzluff, W.F., et al. (1985), Transcription 
and Translation: A Practical Approach", B.D. Hammes and S.J. Higgins. eds., pp. 89-129, IRL Press. Oxford). The iso- 
lated nuclei (or PBS washed human spermatocytes) are embedded in a low melting point agarose matrix and lysed with 
EDTA and proteinase k to expose high molecular weight DNA, which is then digested in the agarose with the restriction 
enzyme Notl as described by M. Finney in Current Protocols in Molecular Biology (F. Ausubel, et al., eds. John Wiley & 
Sons, Supp. 4, 1988, Section 2.5.1). 

The Notl digested DNA is then fractionated by pulsed field gel electrophoresis as described by Anand, R., et al. 
(1 989) Nud Acids Res.. IX 3425-3433. Fractions enriched for the Notl fragment are assayed by Southern hybridiza- 
tion to detect one or more of the sequences encoded by this fragment. Such sequences include the heavy chain D seg- 
ments J segments, ^ and y1 constant regions together with representatives of all 6 VH families (although this fragment 
is identified as 670 kb fragment from HeLa cells by Berman, et al. (1988), supra., we have found it to be as 830 kb frag- 
ment from human placental an sperm DNA). Those fractions containing this Notl fragment (see Fig. 4) are pooled and 
cloned into the Notl site of the vector pYACNN in Yeast cells. Plasmid pYACNN is prepared by digestion of pYAC-4 Neo 
(Cook, H., et al. (1988). Nucleic Acids Res.. 1£, 1 1817) with EcoRI and ligation in the presence of the oligonucleotide 
5-AATTGCGGCCGC-3\ ■ . . , H ^ o 

YAC clones containing the heavy chain Notl fragment are isolated as described by Brownstein, et al. (1989), Set 
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ence, 244, 1348-1351. and Green, E.. et aJ. (1990), Proa Natl. Acad. Sri USA §1 1213-1217. The cloned Notl insert 
is isolated from high molecular weight yeast DNA by pulse field gel electrophoresis as descrfoed by M. Finney, opcit 
The DNA is condensed by the addition of 1 mM spermine and microinjected directly into the nucleus of single ceil 
embryos previously described. 

EXAMPLE 2 

Discontinuous Genomic Heavy Chain la TransaenA 

A 1 10 kb Spel fragment of human genomic DNA containing VH6, D segments, J segments, the \i constant region 
and part of the y constant region (see Fig. 4) is isolated by YAC cloning as described in Example 1 . 

A 570 kb Notl fragment upstream of the 670-830 kb Notl fragment described above containing multiple copies of 
VI through V5 is isolated as described. (Berman, et al. (1988). supra detected two 570 kb Noil fragments. Each of those 
contain multiple V segments.) 

The two fragments are coinjected into the nucleus of a mouse single cell embryo as described in Exarrple 1 . 

Coinjection of two different DNA fragments will usually result in the integration of both fragments at the same inser- 
tion site within the chromosome. Therefore, approximately 50% of the resulting transgenic animals that contain at least 
one copy of each of the two fragments will have the V segment fragment inserted upstream of the constant region con- 
taining fragment Of these animals. 50% will carry out V to DJ joining by DNA inversion and 50% by deletion, depending 
on the orientation of the 570 kb Notl fragment relative to the position of the 1 10 kb Spel fragment DNA is isolated from 
resultant transgenic animals and those animals found to be containing both transgenes by Southern blot hybridization 
(specifically, those animals containing both multiple human V segments and human constant region genes) are tested 
for their ability to express human immunoglobulin molecules. 

EXAMPLE 3 

Genomic K Li ght Chain Human Iq Transaene Formed bv In Vivo Homologous Recombination 

A map of the human k light chain has been described in Lorenz, W.. et al. (1987), Nucl. Acids Res is, 9667-9677 
and is depicted in Fig. 1 1 . 

A 450 kb Xhol to Notl fragment that includes all of Ck. the 3* enhancer, all J segments, and at least five different V 
segments (a) is isolated and microinjected into the nucleus of single cell embryos as described in Example 1. 

EXAMPLE 4 

Genomic k Light Chain Human lo Transa ene Formed by In Vivo Homologous Recombinatinn 

A 750 kb Mlul to Notl fragment that includes all of the above plus at least 20 more V segments (b) is isolated as 
described in Example 1 (see Fig. 1 1) and digested with BssHII to produce a fragment of about 400 kb (c) . 

The 450 kb Xhol to Notl fragment (a) plus the approximately 400 kb Mlul to BssHII fragment (c) have sequence 
overlap defined by the BssHII and Xhol restriction sites shown in Fig. 1 1 . Homologous recombination of these two frag- 
ments upon microinjection of a mouse zygote results in a transgene containing at (east an additional 1 5-20 V segments 
over that found in the 450 kb Xhol/Notl fragment (Example 3). 

EXAMPLE 5 

Construction of Heavy Chain Mini-Locus 
A. Construction of pGPl and dGP2 

pBR322 is digested with EcoRI and Styl and ligated with the following oligonucleotides to generate pGP1 which 
contains a 1 47 base pair insert containing the restriction sites shown in Fig. 1 3. The general overlapping of these oligos 
is also shown in Fig. 13. 

Trie oligonucleotides are: 
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oligo-1 



5 ' 



CTT GAG CCC GCC TAA TGA GCG GGC TTT 
TTT TTG CAT ACT GCG GCC - 3' 



oligo-2 



5 



GCA ATG GCC TGG ATC CAT GGC GCG CTA 
GCA TCG ATA TCT AGA GCT CGA GCA -3 • 



oligo-3 



TGC AGA TCT GAA TTC CCG GGT ACC AAG 
CTT ACG CGT ACT AGT GCG GCC GCT -3 1 



oligo-4 



AAT TAG CGG CCG CAC TAG TAC GCG TAA 
GCT TGG TAC CCG GGA ATT - 3 1 



oligo-5 



CAG ATC TGC ATG CTC GAG CTC TAG ATA 
TCG ATG CTA GCG CGC CAT GGA TCC - 3' 



oligo-6 



AGG CCA TTG CGG CCG CAG TAT GCA AAA 
AAA AGC CCG CTC ATT AGG CGG GCT - 3 1 



This plasmid contains a large polylinker flanked by rare cutting Notl sites for building large inserts that can be iso- 
lated from vector sequences for microinjection. The plasmid is based on pBR322 which is relatively low copy compared 
to the pUC based plasmids (pGP1 retains the pBR322 copy number control region near the origin of replication). Low 
copy number reduces the potential toxicity of insert sequences. In addition, pGP1 contains a strong transcription termi- 
nator sequence derived from trpA (Christie, G.E., et al. (1981), Prop Natl. Arad, Sci. USA) inserted between the amp- 
iciHin resistance gene and the polylinker. This further reduces the toxicity associated with certain inserts by preventing 
readthrough transcription coming from the ampicillin promoters. 

Plasmid pGP2 is derived from pGP1 to introduce an additional restriction site (Sfil) in the polylinker. pGP1 is 
digested with Mlul and Spel to cut the recognition sequences in the polylinker portion of the plasmid. 

The following adapter oligonucleotides are ligated to the thus digested pGP1 to form pGP2. 



pGP2 is identical to pGP1 except that it contains an additional Sfi I site located between the Mlul and Spel sites. 
This allows inserts to be completely excised with Sfil as well as with Notl. 

B. Construction of dRE3 (rat enhancer 3^ 

An enhancer sequence located downstream of the rat constant region is included in the heavy chain constructs. 
The heavy chain region 3' enhancer described by S. Pettersson, et al. (1990), Nature, 344, 165-168) is isolated and 
cloned. The rat IGH 3' enhancer sequence is PCR amplified by using the following oligonucleotides: 

5« CAG GAT CCA GAT ATC AGT ACC TGA AAC AGG GCT TGC 3' 
5' GAG CAT GCA CAG GAC CTG GAG CAC ACA CAG CCT TCC 3« 



The thus formed double stranded DNA encoding the 3' enhancer is cut with- BamHI and Sphl and clone into 
BamHI/Sphl cut pGP2 to yield pRE3 (rat enhancer 3% 



5' CGC GTG GCC GCA ATG GCC A 3' 
5 ' CTA GTG GCC ATT GCG GCC A 3 ' 
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C. Cloning of Human J- M Fteginn 

A substantial portion of this region is cloned by combining two or more fragments isolated from phage lambda 
inserts. See Fig. 14. 

A 6.3 kb BamHI/Hindill fragment that includes all human J segments (Matsuda, et al. (1988). EMBO J.. £ 1047-- 
1051 ; Ravetech, et al. (1981). C§H, 27, 583-591) is isolated from human genomic DNA Itorary using the oligonucleotide 
GGA CTG TGT CCC TGT GTG ATG CTT TTG ATG TCT GGG GCC AAG. 

An adjacent 10 kb Hindlll/Bamll fragment that contains enhancer, switch and constant region coding exons Ofesui, 
et al. (1989). Bur, J. [mmynol,, ja 1399-1403) is similarly isolated using the oligonucleotide* 
CAC CAA GTT GAC CTG CCT GGT CAC AGA CCT G AC C AC CTA TGA 

An adjacent 3' 1.5 kb BamHI fragment is similarly isolated using clone pMUM insert as probe (pMUM is 4 kb 
EcoRl/Hindlll fragment isolated from human genomic DNA library with oligonucleotide: 

CCT GTG GAC CAC CGC CTC CAC CTT CAT 
CGT CCT CTT CCT CCT 



mu membrane exon 1) and cloned into pUCl9. 

pGP1 is digested with BamHI and Bglll followed by treatment with calf intestinal alkaline phosphatase. 

Fragments (a) and (b) from Fig. 14 are cloned in the digested pGP1 . A clone is then isolated which is oriented such 
that 5' BamHI site is destroyed by BamHI/Bgl fusion. It is identified as pMU (see Fig. 15). pMU is digested with BamHI 
and fragment (c) from Fig. 14 is inserted. The orientation is checked with Hindlll digest. The resultant plasmid pHIG1 
(Fig. 15) contains an 18 kb insert encocfing J and Cji segments. 

D. Cloning of Cti Region 

pGP1 is digested with BamHI and Hindlll is followed by treatment with calf intestinal alkaline phosphatase (Fig. 14). 
The so treated fragment (b) of Fig. 14 and fragment (c) of Fig. 14 are cloned into the BamHi/Hindlil cut pGP1 . Proper 
orientation of fragment (c) is checked by Hindlll digestion to form pCON1 containing a 12 kb insert encoding the Cu 
region. . 

Whereas pHIG1 contains J segments, switch and n sequences in its 18 kb insert with an SHI 3' site and a Spel 5' 
site in a polylinker flanked by Notl sites, will be used for rearranged VDJ segments. pCON1 is identical except that it 
lacks the J region and contains only a 12 kb insert The use of pCON1 in the construction of fragment containing rear- 
ranged VDJ segments will be described hereinafter. 

E. Cloning of v-1 Constant Re gion foREG2^ 

The cloning of the human -^1 region is depicted in Fig. 16. 

Yamamura. et al. (1986). Proc. Natl. Acad. Sci. IJSA 83. 2152-2156 reported the expression of membrane bound 
human T 1 from a transgene construct that had been partially deleted on integration. Their results indicate that the 3' 
BamHI site delineates a sequence that includes the transmembrane rearranged and switched copy of the gamma gene 
with a V-C intron of less than 5kb. Therefore, in the unrearranged, unswitched gene, the entire switch region is included 
in a sequence beginning less than 5 kb from the 5' end of the first r 1 constant exon. Therefore it is included in the 5' 
5.3 kb Hindlll fragment (Ellison, J.W., et al. (1982). Nudeic Acids Ras i& 4071^4079). Takahashi, et al. (1982). £g|L 
2& 671-679 also reports that this fragment contains the switch sequence, and this fragment together with the 7.7 kb 
Hindlll to BamHI fragment must include all of the sequences we need for the transgene construct. 

Phage clones containing the y- 1 region are identified and isolated using the following oligonucleotide which is spe- 
cific for the third exon of y-i (CH3). 

5* TGA GCC ACG AAG ACC CTG AGG 

TCA AGT TCA ACT GGT ACG TGG 3 • 



A 7.7 kb Hindlll to Bglll fragment (fragment (a) in Fig. 16) is cloned into Hindlll/Bglll cut pRE3 to form pREGl . The 
upstream 5.3 kb Hindlll fragment (fragment (b) in Fig. 16) is cloned into Hindlll digested pREGl to form pREG2. Correct 
orientation is confirmed by BamHI/Spel digestion. 
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F. Combining Cy and Cu 

The previously described plasmid pHIQ1 contains human J segments and the C\l constant region exons. To pro- 
vide a transgene containing the C\i constant region gene segments, pHIG1 was digested with Sfil (Fig. 15). The plas- 
5 mid pREG2 was also digested with Sfil to produce a 1 3.5 kb insert containing human Cy exons and the rat 3' enhancer 
sequence. These sequences were combined to produce the plasmid pHIG3* (Fig. 17) containing the human J seg- 
ments, the human Cu constant region, the human Cy1 constant region and the rat 3' enhancer contained on a 31 .5 kb 

insert. . 0/ _ . . . . ■ 

A second plasmid encoding human Cu and human Cy1 without J segments is constructed by digest ng pCON1 with 
io Sfil and combining that with the Sfil fragment containing the human Cy region and the rat 3' enhancer by digesting 
pREG2 with Sfil. The resultant plasmid, pCON (Fig. 1 7) contains a 26 kb Notl/Spel insert containing human Cji, human 
y1 and the rat 3' enhancer sequence. 
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Q Cloning of D Segment 

The strategy for cloning the human D segments is depicted in Fig. 18. Phage clones from the human genomic 
library containing D segments are identified and isolated using probes specific for diversity region sequences (Y. Ichi- 
hara, et al. (1988), EM BO J .. Z 4141 -4150). The following oligonucleotides are used: 

DXP1: 5' - TGG TAT TAC TAT GGT TCG GGG AGT TAT TAT 

AAC CAC AGT GTC - 3 1 

DXP4: 5 1 - GCC TGA AAT GGA GCC TCA GGG CAC AGT GGG 

CAC GGA CAC TGT - 3' 

DN4 : 5* - GCA GGG AGG ACA TGT TTA GGA TCT GAG GCC 

GCA CCT GAC ACC - 3* ■ 



A 5.2 kb Xhol fragment (fragment (b) in Fig. 1 8) containing DLR1 , DXP1, DXP'1 , and DA1 is isolated from a phage 
clone identified with oligo DXP1 . 

A 3.2 kb Xbal fragment (fragment (c) in Fig. 18) containing DXP4, DA4 and DK4 is isolated from a phage clone 

35 identified with oligo DXP4. 

Fragments (b), (c) and (d) from Fig. 18 are combined and cloned into the Xbal/Xhol site of pGP1 to form pHK32 

which contains a 10.6 kb insert 

This cloning is performed sequentially. First, the 5.2 kb fragment (b) in Fig. 18 and the 2.2 kb fragment (d) of Fig. 
18 are treated with calf intestinal alkaline phosphatase and cloned into pGP1 digested with Xhol and Xbal. The result- 
40 ant clones are screened with the 5.2 and 2.2 kb insert. Half of those clones testing positive with the 5.2 and 2.2 kb 
inserts have the 5.2 kb insert in the proper orientation as determined by BamHI digestion. The 3.2 kb Xbal fragment 
from Fig. 18 is then cloned into this intermediate plasmid containing fragments (b) and (d) to form pHIG2 (Fig. 9). This 
plasmid contains diversity segments cloned into the polylinker with a unique 5' Sfil site and unique 3* Spel site. The 
entire polylinker is flanked by Notl sites. 



45 



H. Constructio n of Heavy Chain Minilocus 

The following describes the construction of a human heavy chain mini-locus which contain one or more V seg- 
ments. , . , , 
so An unrearranged V segment corresponding to that identified as the V segment contained in the hybndoma of 
Newkirk, et al. (1988), J. Clin. Invest.. S1 1511-1518, is isolated using the following oligonucleotide: 

5' - GAT CCT GGT TTA GTT AAA GAG GAT TTT 
ATT CAC CCC TGT GTC - 3 ' 

55 



A restriction map of the unrearranged V segment is determined to identify unique restriction sites which provide 
upon digestion a DNA fragment having a length approximately 2 kb containing the unrearranged V segment together 
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with 5* and 3' flanking sequences. The 5" prime sequences will include promoter and other regulatory sequences 
whereas the 3' flanking sequence provides recombination sequences necessary for V-DJ joining. This approximately 
3.0 kb V segment insert is cloned into the polylinker of pGB2 to form pVHI . 

pVH1 is digested with Sfil and the resultant fragment is doned into the Sfil ate of pHIG2 to form a pHK35' Since 
pHIG2 contains D segments only, the resultant pHIGS' plasmid contains a single V segment together with D segments 
The size of the insert contained in pHIGS is 10.6 kb plus the size of the V segment insert 

The insert from pHIG5 is excised by digestion with Notl and Spel and isolated. pHIG3' which contains J Cu and 
cy1 segments is digested with Spel and Notl and the 3' kb fragment containing such sequences and the rat 3' enhancer 
sequence is isolated. These two fragments are combined and Ogated into Notl digested pGP1 to produce pHIG which 
contains insert encoding a V segment, nine D segments, six functional J segments. Cu, Or and the rat 3" enhancer The 
size of this insert is approximately 43 kb plus the size of the V segment insert. 

I. Construction of Heavy Chain Mirtitocus bv Homologous Rpcombination 



As indicated in the previous section, the insert of pHIG is approximately 43 to 45 kb when a single V segment is 
employed. This insert size is at or near the limit of that which may be readily cloned into plasmid vectors In order to 
provide for the use of a greater number of V segments, the following describes ia yjya homologous recombination of 
overlapping DNA fragments which upon homologous recombination within a zygote or ES cell form a transgene con- 
taining the rat 3' enhancer sequence, the human Cu. the human Cy1 . human J segments, human D segments and a 
so multiplicrty of human V segments. 

«■ ,~ 6 '?J* BamHI/Hind,ll lament containing human J segments (see fragment (a) in Fig. 14) is doned into 
Mlul/Spel digested pHIG5' using the following adapters: ' 

25 



5' 


GAT 


CCA 


AGC 


AGT 


3 1 


5' 


CTA 


GAC 


TGC 


TTG 


3* 


5 f 


CGC 


GTC 


GAA 


CTA 


3* 



5' AGC TTA GTT CGA 3 



The resultant is plasmid designated pHIG5X> (overlap). The insert contained in this plasmid contains human V D 
and J segments. When the single V segment from pVH1 is used, the size of this insert is approximately 1 7 kb plus 2 kb 
This insert is isolated and combined with the insert from PHIG3' which contains the human J. Cu, y 1 and rat 3' enhancer 
n!?A U !!l CeS ' 8081 ,nSertS C0 " tain human J se 9 mente wh'ch provide for approximately 6.3 kb of overlap between the two 

^ments. When coinjected into the mouse zygote in yjya homologous recombination occurs generating a trans- 
gene equivalent to the insert contained in pHIG. 

TWsapproach provides for the addrtionof a multiplicity of Vsegmerte into the transgene forrnedjnyiyo^ Forexam- 
pie. '«tead of incorporating a single V segment into pHIG5\ a multiplicity of V segments contained on (1) isolated 
genomic DNA, (2) Ugated DNA derived from genomic DNA. or (3) DNA encoding a synthetic V segment repertoire is 

2X7° ? 81 ^ Sfi ! 81,6 10 96nerate PHIQ5 ' % The J 6eflmems fra 9 ment < a > 01 R 9- 14 is «™ doned into 
P T~ V n and 106 Insert IS0lated - insert now corrtains a multiplicity of V segments and J segments which overlap 
wrth the J segments contained on the insert isolated from pHIG3\ When cointroduced into the nucleus of a mouse 
zygote, homologous recombination occurs to generate in yjya the transgene encoding multiple V segments and multiple 
se 9 mem «. multiple D segments, the Cu region, the Cy1 region (all from human) and the rat 3' enhancer sequence. 

J. Constmction of Hftflyy Crmin Mini-locus by Ooinmction of Svnthatic vh Raginn Frao ^ nt Toi? ^ ar wH h H m y r rt , nin 
wo construct 

Synthetic V H region fragments are generated and isolated as previously described. These fragments are coinjected 

w J^ETi? 011 ,n8ert 01 pHIG (or 8 vereion 01 P HIG *** •« contain any V segments). The coin- 

jected DNA fragments are inserted into a single site in the chromosoma Some of the resulting transgenic animals will 
contain transgene inserts that have synthetic V regions located adjacent and upstream of the sequences in the pHIG 
construct These animals will have a larger human heavy chain primary repertoire than the animals described in Exam- 
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pie 5(H). 

EXAMPLE 6 
5 Construction o f I ^ Qhain Minilocus 
A. Construction of oEul 

The construction of pEn1 is depicted in Fig. 21 . The mouse heavy chain enhancer is isolated on the Xbal to EcoRI 
10 678 bp fragment (J. Banerji, et al. (1983), CfilL 2& 729-740) from phage clones using oligo: 

5' GAA TG'G GAG TGA GGC TCT CTC ATA CCC 
TAT TCA~ GAA CTG ACT 3 1 



15 



This En fragment is cloned into EcoRV/Xbal digested pGP1 by blunt end filling in EcoRI site. The resultant plasmid 
is designated pEmul. 

20 B. Constructio n Of * Light chain Minilocus 

The k construct contains at least one human V K segment, ail five human J K segments, the human J-C K enhancer, 
human k constant region exon, and, ideally, the human 3' k enhancer (K. Meyer, et al. (1989), EMB£U. &. 1 959-1 984). 
The k enhancer in mouse is 9 kb downstream from C K . However, it is as yet unidentified in the human. In addition, the 
25 construct contains a copy of the mouse heavy chain J-Cp, enhancers. 

The minilocus is constructed from four component fragments: 

(a) A 1 6 kb Smal fragment that contains the human C K exon and the 3' human enhancer by analogy with the mouse 
locus (fragment (a) in Fig. 20); 

30 (b) A 5' adjacent 5 kb Smal fragment, which contains all five J segments (fragment (o) in r-ig. zoj. 

(c) The mouse heavy chain intronic enhancer isolated from pE u 1 (this sequence is included to induce expression 
of the light chain construct as early as possible in B-cell development. Because the heavy chain genes are tran- 
scribed earlier than the light chain genes, this heavy chain enhancer is presumably active at an earlier stage than 
the intronic k enhancer); and 

35 (d) A fragment containing one or more V segments. 

The preparation of this construct is as follows. Human placental DNA is digested with Smal and fractionated on 
agarose gel by electrophoresis. Similarly, human placental DNA is digested with BamHI and fractionated by electro- 
phoresis. The 16 kb fraction is isolated from the Smal digested gel and the 1 1 kb region is similarly isolated from the 
40 gel containing DNA digested with BamHI. u Hi^ctoH 

The 16 kb Smal fraction is cloned into Lambda FIX II (Stratagene, La Jolla, California) which has been digested 
with Xhol, treated with Wenow fragment DNA polymerase to fill in the Xhol restriction digest product. Ligation of the 16 
kbSmal fraction destroys the Smal sites and lasesXhol sites in tact. 

The 1 1 kb BamHI fraction is cloned into X EMBL3 (Strategene, La Jolla, California) which is digested with BamHI 

45 prior to cloning. 

Clones from each library were probed with the Ck specific oligo: 

5» GAA CTG TGG CTG CAC CAT CTG TCT 
TCA TCT TCC CGC CAT CTG 3 ' 



50 



A 1 6 kb Xhol insert that was subcloned into the Xhol cut pE^1 so that Ck is adjacent to the Smal site. The resultant 
55 plasmid was designated pKapl. See Fig. 22. 

The above Ck specific oligonucleotide is used to probe the X EMBL3/BamHI library to identify an 1 1 kb clone cor- 
responding to fragment (d) of Fig. 20. A 5 kb Smal fragment (fragment (b) in Fig. 20) is subcloned and subsequently 
inserted into pKapl digested with Smal. Those plasmids containing the correct orientation of J segments, Ck and the 
En enhancer are designated pKap2. 
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One or more Vk segments are thereafter subdoned into the Mlul site of pKap2 to yield the plasmid pKapH which 
encodes the human Vk segments, the human Jk segments, the human Ck segments and the human Eji enhancer. This 
insert is excised by digesting pKapH with Notl and purified by agarose gel electrophoresis. The thus purified insert is 
microinjected into the pronucleus of a mouse zygote as previously described. 

C. Construction of k Light Chain Mini locus bv In Vivo Homologous Recombination 

The 1 1 kb BamHI fragment (fragment (d) in Fig. 20) is cloned into BamHI digested pGP1 such that the 3' end is 
toward the Sfil site. The resultant plasmid is designated pKAPint One or more Vk segments is inserted into the 
polylinker between the BamHI and Spel sites in pKAPint to form pKapHV The insert of pKapHV is excised by digestion 
with Notl and purified. The insert from pKap2 is excised by digestion with Notl and purified. Each of these fragments 
contain regions of homology in that the fragment from pKapHV contains a 5 kb sequence of DNA that include the J K 
segments which is substantially homologous to the 5 kb Smal fragment contained in the insert obtained f ran pKap2. As 
such, these inserts are capable of homologously recombining when microinjected into a mouse zygote to form a trans- 
gene encoding V K , J K and C*. 

P. Construction of K Light Chain Mini-Locus bv Cointection of Synthetic Vk Reoion Fragment Together with Lioht Chain 
JC Construct 

Synthetic Vk, region fragments are generated and isolated as previously described. These PNA fragments are 
coinjected with the purified Notl insert of plasmid pKap2 or plasmid pKapH. The coinjected PNA fragments are inserted 
into a single she in the chromosome. Some of the resulting transgenics will contain transgene inserts that have syn- 
thetic V regions located adjacent and upstream of the sequences in the pKap2 or pKapH construct These animals will 
have a larger human k light chain primary repertoire than those described in Example 6(B). 

EXAMPLE 7 

Isolation Of Genomic Clones Corresponding to Rearranged and Expres s ed Copies of Immunoglobulin k Lioht Chain 
Genes 

This example describes the cloning of immunoglobulin k light chain genes from cultured cells that express an 
immunoglobulin of interest. Such cells may contain multiple alleles of a given immunoglobulin gene. For example, a 
hybridoma might contain four copies of the k light chain gene, two copies from the fusion partner cell line and two copies 
from the original B-cell expressing the immunoglobulin of interest. Of these four copies, only one encodes the immu- 
noglobulin of interest despite the fact that several of them may be rearranged. The procedure described in this example 
allows for the selective cloning of the expressed copy of the k light chain. 

A. Pouble Stranded cPNA 

Cells from human hybridoma, or lymphoma, or other cell line that synthesizes either cell surface or secreted or both 
forms of IgM with a k light chain are used for the isolation of polyA+ RNA. The RNA is then used for the synthesis of 
oligc dT primed cPNA using the enzyme reverse transcriptase. The single stranded cPNA is then isolated and G resi- 
dues are added to the 3' end using the enzyme polynucleotide terminal transferase. The Gtailed single-stranded cPNA 
is then purified and used as template for second strand synthesis (catalyzed by the enzyme PNA polymerase) using 
the following oligonucleotide as a primer: 



5» - GAG GTA CAC TGA CAT ACT GGC ATG CCC 
CCC CCC CCC - 3 1 



The double stranded cPNA is isolated and used for determining the nucleotide sequence of the 5' end of the 
mRNAs encoding the heavy and light chains of the expressed immunoglobulin molecule. Genomic clones of these 
expressed genes are then isolated. The procedure for doning the expressed light chain gene is outlined in part B below. 

B. Light Chpin 

The double stranded cPNA described in part A is denatured and used as a template for a third round of PNA syn- 
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thesis using the following oligonucleotide primer: 

5« - GTA CGC CAT ATC AGC TGG ATG AAG TCA TCA GAT 
GGC GGG AAG ATG AAG ACA GAT GGT GCA - 3 



This primer contains sequences specific for the constant portion of the k light chain message (TCA TCA GAT GGC 
GGG AAG ATG AAG ACA GAT GGT GCA) as well as unique sequences that can be used as a primer for the PCR 
amplification of the newly synthesized DNA strand (GTA CGC CAT ATC AGC TGG ATG AAG). The sequence is ampli- 
fied by PCR using the following two oligonucleotide primers: 

5' - GAG GTA CAC TGA CAT ACT GGC ATG -3' 
5« - GTA CGC CAT ATC AGC TGG ATG AAG -3 1 



The PCR amplified sequence is then purified by gel electrophoresis and used as template for dideoxy sequencing 
20 reactions using the following oligonucleotide as a primer: 
5' - GAG GTA CAC TGA CAT ACT GGC ATG -3' 
The first 42 nucleotides of sequence will then be used to synthesize a unique probe for isolating the gene from 
which immunoglobulin message was transcribed. This synthetic 42 nucleotide segment of DNA will be referred to below 

as o-kappa. . . 

25 A Southern blot of DNA, isolated from the Ig expressing cell line and digested individually and in pairwise combi- 
nations with several different restriction endonucleases including Smal, is then probed with the 32-P labelled unique oli- 
gonucleotide o-kappa. A unique restriction endonuclease site is identified upstream of the rearranged V segment. 

DNA from the Ig expressing cell line is then cut with Smal and second enzyme (or BamHI or Kpnl if there is Smal 
site inside V segment). Any resulting non-blunted ends are treated with the enzyme T4 DNA polymerase to give blunt 

so ended DNA molecules. Then add restriction site encoding linkers (BamHI, EcoRI or Xhol depending on what site does 
not exist in fragment) and cut with the corresponding linker enzyme to give DNA fragments with BamHI, EcoRI or Xhol 
ends The DNA is then size fractionated by agarose gel electrophoresis, and the fraction including the DNA fragment 
covering the expressed V segment is cloned into lambda EMBL3 or Lambda FIX (Stratagene, La Jolla, California). V 
segment containing clones are isolated using the unique probe o-kappa. DNA is isolated from positive clones and sub- 

35 cloned into the polylinker of pKapl . The resulting clone is called pRKL 

EXAMPLE 8 

isolation of Genomic Clones Corresponding to Rearr a nged Expressed Copies of ImmunoqlQbulinq H^vy Chain u 



40 



This example describes the cloning of immunoglobulin heavy chain *i genes from cultured cells of expressed and 
immunoglobulin of interest. The procedure described in this example allows for the selective cloning of the expressed 
copy of a p. heavy chain gene. 

45 Double-stranded cDNA is prepared and isolated as described in part A of Example 7. The double-stranded cDNA 
is denatured and used as a template for a third round of DNA synthesis using the following oligonucleotide primer: 

5" - GTA CGC CAT ATC AGC TGG ATG AAG ACA GGA GAC 

GAG GGG GAA AAG GGT TGG GGC GGA TGC -3' 

so 



This primer contains sequences specific for the constant portion of the » heavy chain message (ACA GGA GAC 
GAG GGG GAA AAG GGT TGG GGC GGA TGC) as well as unique sequences that can be used as a primer for the 
PCR amplification of the newly synthesized DNA strand (GTA CGC CAT ATC AGC TGG ATG AAG). The sequence is 
amplif ied by PCR using the following two oligonucleotide primers: 5* - GAG GTA CAC TGA CAT ACT GGC ATG - 3' 
5' - GTA CTC CAT ATC AGC TGG ATG AAG - 3" 

The PCR amplified sequence is then purified by gel electrophoresis and used as template for dideoxy sequencing 
reactions using the following oligonucleotide as a primer: 
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5' - QAG QTA CAC TGA CAT ACT GGC ATG - 3* 
The *iret 42 nucleotides of sequence are then used to synthesize a unique probe for isolating the gene from which 
immunoglobulin message was transcribed. This synthetic 42 nucleotide segment of DNA will be referred to below as o- 

mu. 

A Southern blot of DNA, isolated from the Ig expressing cell fine and digested individually and hi pairwise combi- 
nations with several different restriction endonucleases including Mlul (Mlul is a rare cutting enzyme that cleaves 
between the J segment and mu CH1). is then probed with the 32-P labelled unique oligonucleotide o-mu A unique 
restnction endonuclease site is identified upstream of the rearranged V segment. 

DNA from the IG expressing cell fine is then out with Mlul and second enzyme. Mlul or Spel adapter linkers are then 
ligated onto the ends and cut to convert the upstream site to Mlul or Spel. The DNA is then size fractionated by agarose 
gel electrophoresis, and the fraction including the DNA fragment covering the expressed V segment is cloned directly 
into the ptasmid pGPI. V segment containing clones are isolated using the unique probe o-mu. and the insert is sub- 
cloned into Mlul or Mlul/Spel cut plasmid pCON2. The resulting plasmid is called pRMGH. 

EXAMPLE 9 

Peletipn of the Mouse Heavy Chain Ge ne bv Homologous Recombination 

This example describes the deletion of the endogenous mouse heavy chain gene by homologous recombination in 
embryonic stem (ES) cells fflitetra. et al. (1989). Nature. 342, 435-438) followed by the transplantation of those ES cells 
into a mouse blastocyst embryo such that the ES cells colonize the germline of the resultant chimeric mouse (Terato- 
caranomas and embryonic stem cells: a practical approach. E.J. Robertson, ed, IRL press. Washington DC 1987) 

The construction of a DNA sequence that win homdogously recombine into the mouse chromosome so as to 
delete the heavy chain J segments, thus eliminating the possibility of successful gene rearrangement at the heavy chain 
locus. The design of this construct is outlined below. 

^Plasmid pGPI is digested with the restriction endonucleases BamHI and Bglll and religated to form the plasmid 
pGP1d1 . This plasmid is then used to build the so-called gene knockout construct 

To obtain sequences homologous to the desired target region of the mouse genome, mouse genomic clones are 
isolated from a phage Itorary derived from non-lymphoid tissue (such as liver) using the J H specific oligonucleotide 
probe: 

5» - GGT CTA TGA TAG TGT GAC TAC TTT GAC TAC TGG 
GGC CAA GGC - 3* 



A 3.5 to Kpnl to EcoRI fragment that hybridizes with this probe is isolated from DNA derived from positive phage 
clones. This fragment is subcloned into Kpnl/EcoRI digested pGPIdl to form the plasmid pMKOl. 

Neomycin resistance (Neo) and Herpes Simplex Virus thymidine k inase (TK) genes for drug selection of recom- 
binants (M. Capecchi (1989), Science , 244, 1288-1292) are then isolated as follows. The plasmid pGEM7(KJ1) (M A 
RudnicW, 3/1 5/89) is digested with Hindlll and the ends blunted with the Wenow form of DNA pol I. The DNA is then out 
with EcoRI and the pGKNeo fragment is isolated and cloned into Sphi/Nael cut pMKOl using the following oligonucle- 
otide as an adapter: 

5* - AATTCATG -3' 

The resulting plasmid is designated pMK02. This plasmid contains the neomycin resistance gene flanked by 
sequences that flank the mouse J H segments. This plasmid alone can be used for deletion of the heavy chain gene 
Alternatively the Herpes TK gene can be added to the construct to improve the frequency of homologous recombination 
n™^" 1 d ° neS (M ' Capecchi < 1989 >' Sc^Ce. 244 1288-1292). This is done as follows. The EcoRI to 
Hindlll PGKTK fragment of pGEM7(TK) {MA RudnicW) is isolated and cloned into the Kpnl site of pMK02 using the 
following oligonucleotide as adapters: 

5' - AATTGTAC - 3 1 
5' - AGCTGTAC - 3' 

The resulting plasmid is designated pMK03. 

To further improve the overall efficiency of homologous recombination, a large segment of DNA that is homologous 
to the target sequence is then added to the construct A 13 Kb EcoRI fragment, that hybridizes with the Cn specific oii- 
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gonucleotide described below: 

5* - GCA TCC TGG AAG GTT CAG ATG AAT ACC 
TTG TAT GCA AAA TCC - 3' 



This 12 Kb fragment includes the Cn coding exons, or a substantial portion of that fragment which includes the 5* 
EcoRI end, s isolated from a mouse genomic phage library and subcloned into the EcoRI site of pMK03. The resultant 

PlSS ^i^ by digestion with Notl and electroporated into ES cells. Homologous ^o^inant 

clones are isolated used to generate a J H deleted mouse as described by Zjilstra. et al. (1989), UsSm. 242. 435-438. 
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EXAMPLE 1Q 

p^ir^ nf thft M™** Lio h t nhain Gene bv Homologous Recombination 

This example describes the deletion of the endogenous mouse light chain gene by homologous recombination in 
embryonic stem cells (see previous Example). „ rtnetan * 
A DNA sequence that homologously recombines into the mouse chromosome to delete the k light chain constant 
region exon is constructed. The design of this construct is outlined below. ^ ... 

A 2 kb BamHI I to EcoRI thymidine kinase fragment from pGEM7(TK)Sal (MA Rudructa, Whitehead Institute) is iso- 
lated and subcloned into the BamHI/Sfil digested pGP1 using the following oligonucleotide adapter: 
5' - AATTTTG - 3' 
25 The resulting plasmid is designated pKKOI . 

To obtain sequences homologous to the desired target region of the mouse genome, mouse genomic clones are 
isolated from a phage library derived from non-lymphoid tissue (such as liver) using the mouse k light chain specific 
oligo designated o-MKC given below: 
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5' - GGC TGA TGC TGC ACC AAC TGT ATC CAT 
CTT CCC ACC ATC CAG - 3 1 



DNA is isolated from positive clone and a 2.3 kb Bglll fragment (RS. Neumaier and H.G. Zachau (1 983), NycL 
Acids Res „ ri, 3631 -3656) that hybridizes with probe 0-MK3 is isolated. The sequence of probe 0-MK3 is as follows: 

5* - CAT TCT GGG TAT GAA GAG CCC ACG TAT 
CAA AGG TTA CAT TAG - 3 1 

This 2.3 kb Bglll fragment is subcloned into BamHI digested pKK01 such that the 3' end of the fragment is adjacent 
to the polyiinker Sf il site. The resulting plasmid is designated pKK02. _ i4iwa „ tofltt 

The 4 kb Sphl to Hpal DNA fragment that hybridizes with oligonucleotide o-MKC is isolated from positive phage 
clone and subcloned into EcoRV to Sphl digested plasmid pKK02. The resulting plasmid is designated pKK03. 

A 2 kb Sail to EcoRI fragment of pGEM7(KJ1)Sal (M.A. Rudnicki, 3/15/89) is isolated and cloned into the BssHII 
site of plasmid pKK03 using linker adapters. This is carried out by first ligating a mixture of the follov/ing three oligonu- 
cleotides to the 2 kb Sail to EcoRI fragment: 

5' - CAGCGCGC - 3* 
5' - GATCGCGCGCTG - 3 
5 1 - AATTGCGCGCTG - 3' 



55 



The ligation mixture is then digested with the enzyme BssHII and ligated to BssHII digested plasmid pKK03. The 
resulting plasmid is designated pKK04. 
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The insert of pKK04 is isolated by digesting with Noll and electroporated into ES cells. Homologous recombinant 
clones are isolated and used to generate a deleted mouse as described by Zjilstra. et al. (1989). Nature. 342, 435- 
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om K™ Se T 1pl ^f Cr ,! b ^„ the inac,ivalion * •» ™ u ™ endogenous kappa locus by homologous recombination in 
embryonic stem (ES) cells followed by introduction of the mutated gene into the mouse germ line by injection of targeted 
ES cells bearing an inactivated kappa allele into early mouse embryos (blastocysts). 

The strategy is to delete J K and C K by homologous recombination with a vector containing DNA sequences homol- 
° 9 °"l t0 *• mouse taPP 8 locus * which a 4.5 kb segment of the locus, spanning the J K gene and C K segments is 
deleted and replaced by the selectable marker neo. K ^ 

Construction of the kappg targeting vertnr 

The plasmid pGEM7 (KJ1) (M. A. Rudnicki. Whitehead Institute) contains the neomycin resistance gene (neo) 
used for drug selection i of transfected ES cells, under the transcriptional control of the mouse phosphoglycerate kinase 
(pgk) promoter (Xbal/I/Taql fragment; Adra. C.N. et al.. (1987) Seng, 60. 65-74) in the cloning vector pGEM-72ff». The 
ptasmid also includes a heterologous polyadenylation site for the neo gene, derived from the 3' region of the mouse pgk 
gene (Pvull/Hindlll fragment; Boer. P.H.. et al.. (1990) Biochemical Gftnfttire 28. 299-308). This plasmid was usedas 
the starting point for construction of the kappa targeting vector. The first step was to insert sequences homologous to 
the kappa locus 3' of the neo expression cassette. 

r Mou ^ , ^ a ^ ain sequences (Fig. 25a) were isolated from a genomic phage library derived from liver DNA using 
oligonucleotide probes specific for the Ck locus: 

5«- GGC TGA TGC TGC ACC AAC TGT ATC CAT CTT CCC ACC ATC CAG 
-3' 

and for the Jk5 gene segment: 

5'- CTC ACG TTC GGT GCT GGG ACC AAG CTG GAG CTG AAA CGT AAG - 
3« . 



40 



45 



50 



55 



An 8 kb Bglll/Sacl fragment extending 3' of the mouse C K segment was isolated from a positive phage clone in two 
pieces as a 1.2 kb Bglll/Sacl fragment and a 6.8 kb Sad fragment, and subcloned into Bglll/Sacl digested pGEM7 
(KJ1) to generate the plasmid pNEO-K3' (Fig. 25b). 

sJ^iltSSS**! f ^ ment f 6 ^" 9 5 * * e Jk re9i0n W3S 3,50 isolated from a P 08 ^ 6 clone, An 
adaf *° r l,9ated to the SP" 1 ^ * thi 5 'ragment. and the resulting EcoRI fragment was 
hgated into EcoRI digested pNEO-KS', in the same 5" to 3' orientation as the neo gene and the downstream 3' kappa 
sequences, to generate pNEO-KS'S' (Fig. 25c). 

TheHerpesSimplex Virus (HSV) thymidine kinase (TK) gene was then included in the construct in order to allow 
T^t? 0 " 88 bearin9 homologous recombinants, as described by Mansour et al. ((1988) Nature. 336, 
348-352). The HSV TK cassette was obtained from the plasmid pGEM7 (TK) (MA RudnicW). which contains the struc- 
h l 9 ^!™ 65 for tte HSV TK gene brackBted *• m0U8e P9 k Promoter and polyadenylation sequences as 
described above for pGEM7 (KJ1). The EcoRI site of pQEM7 (TK) was modified to a BamHI site and the TK cassette 
was then excised as a BamHI/Hindlll fragment and subcloned into pGPlb to generate pGP1b-TK. This plasmid was 
linearized at the Xhol site and the Xhol fragment from pNEO-KS^. containing the neo gene flanked by genomic 
sequences from 5' of Jk and 3' of Ck. was inserted into pGP1b-TK to generate the targeting vector J/C Kl (Fig 25d) 
The putative structure of the genomic kappa locus following homologous recombination with J/C K1 is shown in Fta 
Zoe. B ' 
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feneration and analysis of ES ftfllls with targe ted inactivation of a kappa alWQ 

AB-1 ES cells were grown on mitotically inactive SNL76/7 cell feeder layers (McMahon, A. P. and BradleyA (1990) 
Cell, £2, 1073-1085) essentially as described (Robertson, EJ. (1987) in TfirMtooarcinQma^ nnri FnrthrYonic Stem Cells ; 
a Practical Arjoroach E J Robertson, ed. (Oxford: IRL Press), p. 71-1 12). 

A S^gS inalation vector jfC K1 was digested wHh Not. and electroporated into AB-1 cells by the meth- 
ods described (Hasty, P.R.. et al. (1991) Nature, 3J& 243-246). Electroporated cells were plated onto 100™ dishes 
at a density of 2-5 x 1 0 6 cells/dish. After 24 hours, G418 (200ng/ml of active component) and FIAU (0.5nM) were added 
to the medium, and drug-resistant clones were allowed to develop over 1 0-1 1 days. Clones were picked, grazed, 
divided into two portions, and further expanded. Half of the cells derived from each clone were then frozen and the other 
half analyzed for homologous recombination between vector and target sequences. 

DNA analysis was carried out by Southern Wot hybridization. DNA was isolated from the clones as described 
(Laird PW. et al.. (1991) Nud. Acids Res.. 12.) digested with Xbal and probed with the 800 bp EcoRI/Xbal fragment 
indicated in Fig. 25e as the diagnostic probe. This probe detects a 3.7 Kb Xbal fragment in the wild type locus, and a 
diagnostic 1 .8 Kb band in a locus which has homologously recombined with the targeting vector (see Fig. 25aand e). 
Of 358 Q418 and FIAU resistant clones screened by Southern Wot analysis. 4 displayed the 1 .8 Kb Xbal band .ndicatrve 
of a homologous recombination at the kappa locus. These 4 clones were further digested with the enzymes Bglll, Sacl. 
and PstI to verify that the vector integrated homologously into one of the kappa alleles. When probed wrth ^diagnostic 
800 bp EcoRI/Xbal fragment. Bglll, Sacl, and PstI digests of wild type DNA produce fragments of 4.1 5.4, and 7 to, 
respectively, whereas the presence of a targeted kappa allele would be indicated by fragments of ^5 ; and 57 to 
resp ectively (see Fig. 25a and e). All 4 positive clones detected by the Xbal digest showed the expected Bglll. Sacl. and 
PstI restriction fragments diagnostic of a homologous recomWnation at the kappa light chain. 

Generation of mice bearing the inactivated kaooa chain 

The 4 targeted ES clones described in the previous section were injected into C57B1/6J blastocysts as described 
(Bradley. A (1987) in Taratocarcinomas anri Embryonic Stem Cells: A Practica l Approach. E.J. Robertson, ed. (Oxford: 
IRL Press) p. 1 13-151) and transferred into the uteri of pseudopregnant females to generate chimenc mice present- 
ing a mixture of cells derived from the input ES cells and the host blastocyst. Chimeric animals are visually identfied by 
the presence of agouti coat coloration, derived from the ES cell line, on the Wack C57B1/6J background. The AB1 ES 
cells are an XY cell line, thus male chimeras are bred with C57BU6J females and the offspring monrtored for the pres- 
ence of the dominant agouti ooat color. Agouti offspring are indicative of germline transmission of the ES B^J™ 
heterozygosity of agouti offspring for the kappa chain inactivation is verified by Southern Wot analysis ol DNA from tail 
biopsies using the diagnostic probe utilized in identifying targeted ES clones. Brother-sister matings of heterozygotes 
are then carried out to generate mice homozygous for the kappa chain mutation. 

EXAMPLE 12 

Inactivation of *h a Mmse Heavy Chain Gene PV Homolonous Recombination 

This example describes the inactivation of the endogenous murine immunoglobulin heavy chain locus by homolo- 
gous recomWnation in embryonic stem (ES) cells. The strategy is to delete the endogenous heavy chain J segments 
by homologous recomWnation with a vector containing heavy chain sequences from which the J H region has been 
deleted and replaced by the gene for the selectaWe marker neo. 

Constructio n of a heavy chain targeting vector 

Mouse heavy chain sequences containing the J H region (Fig. 26a) were isolated from a genomic phage library 
derived from the D3 ES cell line (Gossler. et al.. (1986) Proc Natl. Acad- Sci. U.S.A.. §3, 9065-9069) using a J H 4 spe- 
cific oligonucleotide probe: 

5'- ACT ATG CTA TGG ACT ACT GGG GTC AAG GAA CCT CAG TCA CCG -3' 

A 3 5 kb genomic Sacl/Stul fragment, spanning the J H region, was isolated from a positive phage clone and sub- 
cloned into Sacl/Smal digested puc18. The resulting plasmid was designated puc18 I J„. Thi | "W"^^^ 
(neo) used for drug selection of transferred ES cells, was derived from the plasmid pGEM7 (KJ1). The Hindlll site in 
r>GEM7 (KJ1) was converted to a Sail site by addition of a synthetic adaptor, and the neo expression cassette excised 
by digestion with Xbal/Sall. The ends of the neo fragment were then blunted by treatment with the Wenow form of DNA 
poll, and the neo fragment was subcloned into the Nael site of puc18 J H . generating the plasmid puc18 J H -neo (Fig. 

^Further construction of the targeting vector was carried out in a derivative of the plasmid pGP1b. pGP1b was 
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digested with the restriction enzyme Notl and ligated with the following oligonucleotide as an adaptor: 

5'- GGC CGC TCG ACG ATA GCC TCG AGG CTA TAA ATC TAG AAG AAT TCC 
AGC AAA GCT TTG GC -3 ' 



The resulting plasmid, called pGMT, was used to build the mouse immunoglobulin heavy chain targeting construct 
The Herpes Simplex Virus (HSV) thymidine kinase (TK) gene was included in the construct in order to allow for 
enncnmerrt of ES clones bearing homologous recombinants, as described by Mansour et al. ((1988) Nature 356. 348- 
352). The HSV TK gene was obtained from the plasmid pGEM7 (TK) by digestion with EcoRI and Hindlll The TK DMA 
fragment was subcloned between the EcoRI and Hindlll sites of pGMT, creating the plasmid pGMT-TK (Rg 26c) 
e . p ™ dean extensive region of homology to the target sequence, a 5.9 Kb genomic Xbal/Xhol fragment, situated 
5 ofthe J H region, was derived from a positive genomic phage done by limit digestion of the DNA with Xhol. and partial 
digestion with Xbal. As noted in Rg. 26a and 26b. this Xbal site is not present in genomic DNA, but is rather derived 
from phage sequences immediately flanking the cloned genomic heavy chain insert in the positive phage clone The 
fragment was subcloned into Xbal/Xhol digested pGMT-TK. to generate the plasmid pOMT-TK-JhS 1 (Rg 26d) 

The final step in the construction involved the excision of the 3 Kb EcoRI fragment from puc18 J H -neo which con- 
tained Ihe neo gene and flanking genomic sequences. This fragment was blunted by Wenow polymerase and sub- 
cloned into the similarly blunted Xhol site of pGMT-TK-J H 5\ The resulting construct. J H K01 (Rg. 26e). contains 6 9 Kb 
of genomic sequences flanking the J H locus, with a 2.3 kb deletion spanning the J„ region into which has been inserted 
the neo gene. Rg. 25f shows the structure of an endogenous heavy chain allele after homologous recombination with 
the targeting construct. 

EXAMPLE 13 

Generation and analysis of targeted ES calls 

SNL mSZ!LT*^ i- P n and !! radleyA (1990) £SL£2 ' 1073 " 1 °85) were grown on mitotically inactive 
r^Tp 1 f f layers essentially as described (Robertson. E.J. (1987) Teratocarcinomas «nd Emhrvnnir fit,™ 

Cells; A Practical Appfoaoh. E.J. Robertson, ed. (Oxford: IRL Press), pp. 71-1 1^ 

The heavy chain inactivation vector J H K01 was digested with Notl and electroporated into AB-1 cells by the meth- 
f? T"5?J^ RR - * aL ( 1991 )NaMg3§&. 243-246). Electroporated cells were plated into 100 mm dishes at 
a density* 2-5 x 10 6 cells/dish. After 24 hours. G418 (200mg/ml of active component) and FIAU (0.5mM) were added 
to the medium, and drug-resistant dones were allowed to develop over 8-10 days. Clones were picked, trypsinized 
divided '"to two portions, and further expanded. Half of the cells derived from each clone were then frozen and the other 
naif analyzed for homologous recombination between vector and target sequences 

dui a ^!? !? Carri8d 0Ut *» Southern blot WMvaon. DNA is isolated from the clones as described (Laird 
P.W et al (1991) Nud. Acids Res. , 12.) digested with Hindlll and probed with the 500 bp EcoRI/StuI fragment desig- 
« £1 * , !5 ,ag " ostic P robe in R 9- 261 This Prabe detects a Hindlll fragment of 2.3 kb in the wild type locus whereas 
55 ^r^^f 0 * 8 tar06ted ,0CUS has nomol °3ously recombined with the targeting vedor (see Rg. 
26a and f). Additional digests with the enzymes Spel, Stul. and BamHI are carried out to verify the targeted disruption 
of the heavy chain allele. 

EXAMPLE 14 

Heavy Chain Miniloajs Trarffig ftnft 

A. QpngfructiQn Of Plasmid vectors for clo n ing laroe DNA sequences 
l.flQElfl 

The plasmid pBR322 was digested with EcoRI and Styl and ligated with the following oligonucleotides: 
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oligo-42 5-- caa gag ccc gcc taa tga gcg ggc ttt ttt ttg cat 
act gcg gcc get -3 * 

oligo-43 5'- aat tag egg ccg cag tat gca aaa aaa age ccq etc 
att agg egg get -3' 



10 



The resulting plasmid. pGPla, is designed for cloning very large DN A constructs that can be excised by the rare 
cutting restriction enzyme Not., .t contains a Notl restriction site downstream (relative to the ^J'^ 
is AmpR) of a strong transcription termination signal derived from the trpA gene (Chr.stie, G.E. et al. (1981V ^JM, 
Ar ari m. USA. 78. 4180). This termination signal reduces the potential toxicity of coding sequences inserted mto the 
^TlyMrdno readthrough transcription from the AmpR gene. In addition, this plasmid is low copy re lata* to 
the pUC plasmids because it retains the pBR322 copy number control region. The low copy number further reduces the 
potential toxicity of insert sequences and reduces the selection against large inserts due to DNA repl.cat.on. 

20 

2. pGP1b 

pGPla was digested with Notl and ligated with the following oligonuoleotides: 
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oligo-47 5'- ggc cgc aag ctt act get gga tec tta att aat cga 
tag tga tct cga ggc -3 f 

oligo-48 5'- ggc cgc etc gag ate act ate gat taa tta agg ate 
cao cag taa get tgc -3' 



The resulting plasmid, pGP1b. contains a short polylinker region flanked by Notl sites. This facilitates the construc- 
tion of large inserts that can be excised by Notl digestion. 

40 3. pGPe 

The following oligonucleotides: 

oligo-44 5'- etc cag gat cca gat ate agt acc tga aac agg get 

45 

tgc -3' 

oligo-45 5'- ccc gag cat gca cag gac ctg gag cac aca cag cct 

tec -3' 



were used to amplify the immunoglobulin heavy chain 3' enhancer (S. Petterson, et al. (1990) Nature. 344. 165-168) 
55 from rat liver DNA by the polymerase chain reaction technique. , nWM rw 
The amplified product was digested with BamHI and Sphl and cloned into BamHI/Sphl digested pNN03 feNNOB 
is a pUC derived plasmid that contains a polylinker with the following restriction sites, listed in order: Notl. BamHI Ncol. 
Clal. EcoRV. Xbal. Sacl. Xhol. Sphl. Pstl. Bglll. EcoRI. Smal. Kjpnl. Hindlll. and Notl). The resulting ,ptasm4 Pj^JJ 
digested with BamHI and Hindlll. and the insert containing the rat Ig heavy chain 3' enhancer cloned into BamHI/Hindlll 
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digested pGP1b : The resulting plasrrid. pGPe (Hg. 27 and Table 1). contains several unique restriction sites into which 
sequences can be cloned and subsequently excised together with the 3' enhancer by Noil dlgSET 



A^KcgcccccrcgagatcaCTatcsattaattaaggatccaoaca-Motacc-'-aaa.. 

^ccccaccctgcaactgca^t^ 



AGTTOTlACCCT^OAa^^ 

CMCGSXATCATTACCCCttTGAACAG^^ 
TAACATX^CXCCTT^Tt*^ 

-SCSGCATCAGAGCAGATTGTACTGAC^G-S- ^^-^^Sz25f?' lCk * ^^^^^^TACTGGCTTAAC--. 
=CA7CAr^CCTCTTCcS~^ 

^GG^CCCTAAAAAGGCCGCG^^^ 
"™ ^Ir^S^-^^^Ir^AA^^CGCT 




Table l Sequence of vector pGPe. 



B- Construction Of loM expressing miniin^ t ransoftna plPiM^ 
1 . Isolation of J. u constant renion clones pnri instruction nf mmi 

he ltT P Sf nta ' fleno ™ DNA . ltt5 rary cloned into the phage vector XEMBL3/SP6/T7 (aonetech laboratories 
Inc.. Palo Alto. CA) was screened with the human heavy chain J region specific oligonudeoMe: ' 



35 



EP 0 814 159 A2 



oligo-1 5'- gga ctg tgt ccc tgt gtg atg ctt ttg atg tct ggg 

gcc aag -3 1 

and the phage clone M.3 isolated. A 6 Kb Hindlll/Kpnl fragment from this clone, containing all six J "V"^ ""J* 
asbse^memDHQszandthehe^ 

hurran ji specific oligonucleotide: 

10 oligo-2 5'- cac caa gtt gac ctg cct ggt cac aga cct gac cac 
eta tga -3' 

»nd the nheae clone Ji2 1 isolated. A 10.5 kb Hindlll/Xhol fragment, containing the n switch region and all of the n con- 
15 this clone. Tnese two fragments were ligated together with Kpn^Xho. d,gested 

pNN03 to obtain the plasmid pJM1 . 
2. EsJME 

A4kbXhol fragment was isolated from phage clone X2.1 that cwtains sequences immediately d^ 
chances in nJMI including the so called En element involved in u deletion in certain IgD expressing B-cells {H. Yasu. 

fl'l Y hn Senow freated pJM1 The resulting plasmid. pJM2 (Fig. 28), had lost the internal Xhol site but 
2£ f25 & sledT^lele r^ ton by th^Klenow enzyme. pJM2 contains the entire humar , J reg,on 
STKSrSin J-n intronic enhancer, the u swrtch region and ail of the, consent reg.on axons, as well as the two 0.4 
kb direct repeats, oji and lu, involved in u deletion. 

a Isolation of n ranion clor| f« ""d construction of PPH1 

The following human D region specific oligonucleotide: 

oligo-4 5'- tgg tat tac tat ggt teg ggg agt tat tat aac cac 
agt gtc -3' 

was used to screen the human placenta genomic library for ^^J^^^^^^^ 
A 5 5 kb Xhol fragment, that includes the D elements D K1 , D N1 , and D M2 (Y. Ichihara et al (1988) EM3QJ, ^ «4i ). 
was feSated from Phage clone M.I . An adjacent upstream 5.2 kb Xhol fragment, that includes the D elements iD^. 
was isotated from J x4 3 Eacn « tee D reg ion Xhol fragments were cloned into 

55ai* plaSTeSoTpSPT^ (Promega. Madison. W 0 so as to desfroy the Xhol 
sMu^es The upsEeam fragment was then excised with Xhol and Smal. and the downstream fragment wrth EcoRV 
aS Zl Th^CisolatL fragments were ligated together wrth Sail digested " 
pDH1 contains a 10.6 kb insert that includes at least 7 D segments and can be excised w,th Xhol (50 and EcoRV (3"). 

4. p£QBl 

Th 0 «i aC miH n.iM2 was diaested with Asp71 8 (an isoschizomer of Kpnl) and the overhang filled in with the Klenow 
tragm^^ 

ligated to the Xhol/EcoRV insert of pDH1 and Xhol/Clal digested pGPe to generate pCOR1 (Fig. 29). 

5. pVH251 

A 10.3 kb genomic Hindlll fragment containing the two human heavy chain variable "JjWJSSf 1 "* 
V H 105 (C.G. Humphries et al. (1 988) Nature 231 446) was subcloned into pSP72 to gwe the plasmri pVH251 . 
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6. plGM1 

The plasmid pCOR1 was partially digested with Xhol and the isolated XhoUSall insert of pVH251 cloned into the 
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10 



upstream Xhol site to generate the plasmid plQM1 (Fig. 30). pIGMl contains 2 functional human variable region seg- 
ments, at least 8 human D segments all 6 human J H segments, the human J-u enhancer, the human cu element the 
human u switch region, all of the human u coding exons. and the human I* element together with the rat heavy chain 
3 enhancer, such that all of these sequence elements can be isolated on a single fragment, away from vector 
sequences, by digestion with Noll and microinjected into mouse embryo pronuclei to generate transgenic animate. 

C. Construction of IqM and loG expressing minilocus transoene. dHC1 
1 . Isolation of v conse nt region clones 

The following oligonucleotide, specific for human ig g constant region genes: 



oligo-29 5'- cag cag gtg cac acc caa tgc cca tga gcc cag aca 
is ctg gac -3 ' 



was used to screen the human genomic Ibrary. Phage clones 129.4 and A29.5 were isolated. A 4 kb Hindill fragment 
ofphage clone X29.4, containing ay switch region, was used to probe a human placenta genomic DNA library cloned 
.ntothe phage vector lambda FIX™ II (Stratagene. la Jolla. CA). Phage done XSg1 .13 was isolated. To determine the 
subclass of the different y clones, dideoxy sequencing reactions were carried out using subclones of each of the three 
phage clones as templates and the following oligonucleotide as a primer: 
oligo-67 5- tga gcc cag aca ctg gac : 3' 

Phage clones X29.5 and *Sy1 .13 were both determined to be of the y1 subclass. 

2. Pyel 

A 7.8 kb Hindill fragment of phage clone X29.5. containing the T 1 coding region was cloned into pUC18. The result- 
ing plasmd, pLT1. was digested with Xhol. Wenow treated, and religated to destroy the internal Xhol site. The resulting 
clone. pLTIxk, was digested with Hindill and the insert isolated and cloned into pSP72 to generate the plasrhid clone 
pLTIxks. Digestion of pLTIxks at a polylinker Xhol site and a human sequence derived BamHI site generates a 7 6 kb 
fragment containing the Y 1 constant region coding exons. This 7.6 kb Xhol/BamHI fragment was cloned together with 
an adjacent downstream 4.5 kb BamHI fragment from phage done X29.5 into Xhol/BamHI digested pGPe to generate 
the plasmid done pyel. pyel contains all of the T 1 constant region coding exons. together with 5 kb of downstream 
35 sequences, linked to the rat heavy chain 3' enhancer. 

3. Pye2 
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A 5.3 kb Hindill fragment containing the r 1 switch region and the first exon of the pre-switch sterile transcript (P. 
Sideras et al. (1989) International Immunol 1 631) was isolated from phage clone XS/1.13 and cloned into pSP72 with 
the polylinker Xhol site adjacent to the 5' end of the insert, to generate the plasmid done pSyl s. The Xhol/Sall insert of 
pSrls was doned into Xhol digested pyel to generate the plasmid done pye2 (Fig. 31). pye2 contains all of the y1 con- 
stant region coding exons. and the upstream switch region and sterile transcript exons. together with 5 kb of down- 
stream sequences, linked to the rat heavy chain 3' enhancer. This done contains a unique Xbol site at the 5' end of the 
insert The entire insert, together with the Xhol site and the 3* rat enhancer can be excised from vector sequences by 
digestion with Notl. 



4. DHC1 



The plasmid pIGMl was digested with Xhol and the 43 kb insert isolated and doned into Xhol digested pge2 to 
generate the plasmid pHCl (Fig. 30). pHCl contains 2 functional human variable region segments, at least 8 human D 
segments all 6 human J H segments, the human J-^ enhancer, the human op. element the human ji switch region all 
of the human u coding exons. the human Xu element and the human y 1 constant region, inducing the associated 
switch region and sterile transcript associated exons, together with the rat heavy chain 3' enhancer such that all of 
these sequence elements can be isdated on a single fragment, away from vector sequences, by digestion with Notl and 
microinjected into mouse embryo pronudei to generate transgenic animals. 
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D. Construction of IqM and laG expressin g minilocus transaene. PHC2 
1 . Isolation of human heavy chain V region gene VH49.8 

The human placental genomic DNA library lambda, FIX™ II, Stratagene, La Jolla, CA) was screened with the 
lowing human VH1 family specific oligonucleotide: 

oligo-49 5 1 - gtt aaa gag gat ttt att cac ccc tgt gtc etc tec 
aca ggt gtc -3 1 



Phage clone 7,49.8 was isolated and a 6.1 kb. Xbal fragment containing the variable segment VH49.8 subcloned 
into pNN03 (such that the polylinker Clal site is downstream of VH49.8 and the polylinker Xhol site is upstream) to gen- 
is erate the plasmid pVH49.8. An 800 bp region of this insert was sequenced, and VH49.8 found to have an open reading 
frame and intact splicing and recombination signals, thus indicating that the gene is functional (Table 2). 
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TTCCTCAGGC AGGATITAGG GC.TGSTCTC TC7_3CATCCC ACA L ' I.WZ- r 
aGCrGATGTG GCATCTGTGT TTTCITTCrC ATCC2AGA3C JffiGCTTTES 
CIGIGAAAIA CCCTGCCTCA TGAAIATGCA AAIEATCTGA GS-CT-CTS-. 
GAIAAAIAIA (aSKKTTGG TGCCCSGAGA ACAACCAGAT 



~-yva___- 2er-r_r.ya.L3 lr_je_Va_G- 



200 



TCCICCTCIA AASAGCCXT TGSGAGCACA (2XMO_X ATCGACISGA 25C 

CCTGGAG3TT CCTCTTTGTG GTGGCAGCAG CTACAGotaa ogaqcttccr 300 
hrTrpArgPh eLeuPheVal ValAlaAlaA laThr 

sgrccraagg crgaggaagg catccrggrt ragczaaaca cgar~ra_- _5C 



o^.^. oraj^ A v_.-.r_=-. o^.ox.w C7C3GTGAAG GTCTCCTGC.-. -.11 
nSerGiyAia GIuVaiLvsl vsProGlvSe rSen/allvs ValSe*C-«=L 
AGGCHOGG AGGCACCTTC AGCAGCIATG GCTCTCSS 50C 

ysAlaSerGI yGlvThrPhe SerSerTvrA LalleSe^"- pVaJAr^"'' - 
GOOCCTGGAC AAGGGCTTGA GTOGMGGGA AGGATCATCC CTAIXrKSc- 550 
AlaProGlyG lnSLyLeuGl uTrpMetGlv ArcIIelleP roIleLeuGI 
TAIRQCAAAC TMGGACAGA AGTTCCAGGG CAGAGTCACG AITACCGCG3 600 
ylleAlaAsn TyrAlaGlnL ysPheGlnGl vAroVaiThr IleThrAlaA 
ACAAATOCAC GAGT_^GCC IACATGGAGC rGfiGC?-5CCr 3AGASC.GAG £5C 
spLysSerTh rSerThrAla tyxMetGluL euSerSerLe uArgSerGlu 
GACACG30CG TGIATTACTG TQC5ftGK^CS2_i^ AACCCACA7C ~C: 
AspThrAl aV alTvrTvrC y sAlaArc 

OGAGfiGIT_IJ2i3_i^^ 750 
AGA1GACAGG GTTIATIAGG TTTAAGGCIG TTTACAAAAT GGGTTA1ATA 800 
mGAGAAAA AA 812 



Table 2 



Sequence olhuman V H I family gene V H 49.8 



pV2 

A4 kbXbal genomic fragment containing the human V H IV family gene V„4-21 (I. Sanz et al. (1989) EM BO J g. 
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37411 subcloned into the plasmid pUC12. was excised with Smal and Hindlll. and treated with the Klenowfragmerrt of 
wSe^rCiuS ended fragment was then cioned into C.a. digested, Wenow treated. ^WM^ nj-Ang 
pTasTd PW. Stains the human heavy chain gene VH49.8 linKed upstream of VH4-21 in the same onentaton. w.th 
a unique Sail site at the 3' end of the insert and a unique Xhol site at the 5' end. 

3- pSti -y 

A 0.7 kb Xbal/Hindlll fragment (representing sequences immediately upstream of, ^jfj™? ^^taamert " 
switch region containing fragment in the plasmid pre2) together with the ne.ghbor.ng «6tainS.l kb Xba I fragment 

feLs prior to switching to the y 1 isotype (P. Sideras et al. (1989) IntfrpatlQnal Immunol, . 1 . 631^ Because .the t an 
sc St is implicated in the initiation of isotype switching, and upstream cis-act.ng seque "ces are ofter fo ;£ n 
scription regulation, these sequences are included in transgene constructs to promote correct express.on of the stenle 
is transcript and the associated switch recombination. 

4. pVGE1 

The pSy1-5' insert was excised with Smal and Hindlll. treated with Klenow enzyme, and ligated with the following 
so oligonucleotide linker: 

TheTgSon 9 ^ digested with Sail and ligated to Sal. digested pV2. The resulting plasm*. pVP. contains 3* 
£ JT^toh 5' flanking sequences linked downstream of the two ^anvariable ^^^Z SwX 
21 (see Table 21 The pVP insert is isolated by partial digestion with Sail and complete d.gestion w.th Xhol, ^'ow* 
LKJuI the 1 5 kb fragment on an agarose gel. The insert is then cloned into the Xhol site of pye2 to generate the 
*SEL ^ P vSuS^ P^ElLtain; two human heavy chain variable gene segments upstream of*. 
S«S^ and abated swHch region. A unique Sail site between me 

c^n be used to clone in D, J, and n gene segments. The rat heavy chain 3' enhancer >s tanked to the 3 end of the T 1 
gene and the entire insert is flanked by Notl sites. 

5.pHC2 

The olasmid clone pVGE1 is digested with Sail and the Xhol insert of pIGMI is cloned into ti The resulting clone, 
PhS^SSSL 4 functional human variaWe region segments, at least^ B human ° «^ ™ 
sWments the human J-m enhancer, the human ^ element, the human u swrtch region all of the ^ » 
I^rTS hurranTu element and the human Y 1 constant region, including the associated switch region and sterile 

^SXjSSSmZ ** linked to the rat heavy chain v enhancer, such that all of the sequence elemente oan be 
HSl ^onlsSefragment away from vector sequences, by digestion wM 
SSudeitogenLetrar^enicanirmfe 

San vaSgene segments to further expand the recombinational diversity of this heavy cha.n m.n,locus. 
F Transgenic mice 

The Notl inserts of plasirMs pIGMI and pHC1 were isolated from vector sequences by agarose gel electrophore- 
sis. S ^Stel P were microinjected into the pronuclei of ^I^BUe x CBA)F2 mouse err^ a* 
transferred the surviving embryos into pseudopregnant females as descnbed by Hogan et al JJ^ 0 ™": 
andE Lacv Methods of Manipulating the Mouse Embryo, 1986. Cold Spring Harbor Laboratory. New Ybrk). Mice that 
Kn^Si we^ analyzed for the presence of transgene sequences by Southern blot ana.ys.s of 

SesS fcESKJjK to 8 weeks of age. serum was isolated from these animals and as^ed for the presence* 
torLene eroded human IgM and lgG1 by ELISA as described by Harlow and Lane (E. Harlow and D. Lane. Arrtibod- 
STl^S MaZl, 1988. Odd Spring Harbor Laboratory. New York). Microliter plate <^"£ 
ZiHSSSL antibodies specific for human IgM (clone AF6. =0285. AMAC, Inc. Westbrook. ME) and human IgGI 
^oneJL5T=0280 AMAC, Inc. Westbrook, ME) . Serum samples were serially diluted into the wells and the presence 

with affinfty isolated alkaline phosphatase conjured goat g (poly- 

valent) that had been pre-adsorbed to minimize cross-reactivity with mouse .mmunoglobulins. Fig. 33 shows *e results 
of aTE^A^^r the^elence of human IgM and lgG1 in the serum of two animals that developed from ^embryos 
5SS !!£5£^ « of plasmid pHCl . One of the animals (#1 8) was negative for the transgene by South- 
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em blcrt analysis, and shewed no detectable levels of human IgM or lgG1 . The second animal (#38) contained approx- 
TL< ^ ,eS the tranSgene ' 33 b V Southern Wotting, and showed detectable levels of both human IgM 

TeSU J?* E , L,SA aSSayS for 1 1 developed from transgene injected embryos is summarized 

tn the table below (Table 3). 



Table 3 
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Detection of h 


uman IgM and lgG1 in the serum of transgenic animals by ELISA assay 


OJUIIIGU TT I^WI 1 


injected transgene 


approximate transgene 


human IgM 


human 


6 


plGM1 


1 


++ 




7 


plGM1 


0 






9 


plGM1 


0 






10 


plGM1 


0 






12 


plGM1 


0 






15 . 


plGM1 


10 


++ 




18 


pHCl 


0 






19 


PHC1 


1 






21 


pHC1 


<1 






26 


pHC1 


2 


++ 


+ 


38 


pHC1 


5 


++ 


+ 



Table 3 shows a correlation between the presence of integrated transgene DNA and the presence of transgene 
encoded immunoglobulins in the serum. Two of the animals that were found to contain the pHC1 transgene did not 
express detectable levels of human immunoglobulins. These were both tow copy animals and may not have contained 
complete cop.es of the transgenes, or the animals may have been genetic mosaics (indicated by the <1 copy per cell 
estimated for animal #21). and the transgene containing cells may not have populated the hematopoetic lineage Alter- 
natively, the transgenes may have integrated into genomic locations that are not conducive to their expression The 
detection of human IgM in the serum of pIGMI transgenics, and human IgM and IgGl in P HC1 transgenics, indicates 
that the transgene sequences function correctly in directing VDJ joining, transcription, and isotype switching. 

EXAMPLE 1S 

Rearranged Heavy Chain Transgenes 

A. Isolation of Rearn^flftfl tfl |ffnan Heavy Ch ain VDJ senmantc , 

Two human leukocyte genomic DNA libraries cloned into the phage vector IEMBL3/SP6/17 (Qonetech Laborato- 
ries^ Inc.. Palo Alto. CA) are screened with a 1 kb Pacl/Hindlll fragment of X1.3 containing the human heavy chain J-u 
■rtromc enhancer. Positive clones are tested for hybridization with a mixture of the following V H specific oligonucle- 



so oligo-7 5 '-tea gtg aag gtt tec tgc aag gca tct gga tac acc ttc 
acc-3 ' 



oligo-8 5 '-tec ctg aga etc tec tgt gca gec tct gga ttc acc ttc 
agt-3 ' 
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Clones that hybridized with both V and Jn probes are isolated and the DNA sequence of the rearranged VDJ seg- 
ment determined. 

R Construction nf rearranoe H human heavy chain transaeneS 

Fragments containing functional VJ segments (open reading frame and splice signals) are subdoned into the plas- 
mid vector pSP72 such that the plasmid derived Xhol site is adjacent to the S end of the .nsert sequence. A subdone 
contaNng a functional VDJ segment is digested with Xhol and Pad (Pad, a rare-cutting enzyme, reco ^ ni ^^^f ^"^.^ 
*e J-m intronic enhancer), and the insert doned into Xhol/Pacl digested P HC2 to generate a ^ en « ro "^ w * 
„ a fundiona. VDJ segment, the J-n intronic enhance, the , switch element, the M ^nstarrt 

v1 constant reoion including the sterile transcript assodated sequences, the r 1 switch, and the coding exons. Tnis 

animals as described above. 
15 EXAMPLE 16 

i if| i7t fthain Transoenes 
A Construction of P lasmid vectors 
1 . Plasmid vector pGP1c 

Plasmid vector pGPl a is digested with Notl and the following oligonucleotides ligated in: 
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oligo-81 5'-ggc cgc ate ccg "ggt etc gag gtc gac aag ctt teg agg 
ate cgc-3' 

oligo-82 5'-ggc cgc gga tec teg aaa get tgt cga cct cga gac ccg 
gga tgc-3 1 



The resulting plasmid. pGP1c, contains a polylinker with Xmal, Xhol. Sail, Hindlll, and BamHI restriction sites flanked 
by Notl sites. 

40 2. Plasmid vector pGP 1 d 

Plasmid vector pGP1a is digested with Notl and the following oligonucleotides ligated in: 



45 



oiigo-87 S'-ggc cgc 
-3' 



tgt cga caa get tat cga tgg ate etc gag tgc 



oligo-88 5'-ggc cgc act cga gga tec ate gat aag ctt gtc gac age 

50 -3 ' 

The resulting plasmid. pGP1d, contains a polylinker with Sail, Hindlll. Clal, BamHI. and Xhol restriction sites flanked by 
55 Notl sites. 

r i llation of Jx: and Ck clones 

A human placental genomic DNA library cloned into the phage vector XEMBL3/SP6/T7 (Clonetech Laboratories. 
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Inc.. Palo Alto. CA) was screened with the human Kappa light chain J region specific oligonucleotide: 

oligo-36 5«- cac ctt egg cca agg gac acg act gga gat taa acg 
taa gca -3 • 

™* ^ e p ^\ d °"f 1 5 2 *« 1365 iso,ated - A 7.4 kb Xhol fragment that includes the M segment was isolated 
11 f- a ? d J Eub ? loned ,nt0 tne P ,asmid P NN03 to Senerate the piasmid clone p36.2. A neighboring 13 kb Xhol frag- 
ment that mcludes Jk segments 2 through 5 together with the Gc gene segment was isolated from phage done 136 5 
and subdoned Unto the piasmid pNNOS to generate the piasmid done p36.5. Together these two dones span the region 
beginning 7.2 kb upstream of Jk1 and ending 9 kb downstream of Ck. 

C. Construction nt rearranged linht rA ain fransaenag 

1 . pCK1 , a Ck vector for expressing rearranged variable segments 

■ ^ Th l 1 i'*i (h ° l insert ofplasmid cl °nep36.5 containing the Cxgene. together with 9 kb of downstream sequences. 
?J^™!Z?l™ ^ * v **'P QP1c s ' end * ir «ert adjacent to the piasmid Xhol site. The 

resutting done, pCKI can accept doned fragments containing rearranged VJk segments into the unique 5 1 Xhol site 
The transgene can then be exdsed with Noll and purified from vector sequences by gel electrophoresis. The resulting 
transgene construct will contain the human J-Ck intronic enhancer and may contain the human 3' k enhancer. 

2. pCK2, a Ck vector with heavy chain enhancers for expressing rearranged variable segments 

1 A ,? n t^^fi^ a9mert a m0USe 9 ® nomic DNA confining the mouse heavy chain J-u intronic enhancer (J. Banerji 

Sum Hi ^L"!!!!:! i' led in Wen0VV enZyme " 1)16 Wenow treated DNA was »n en ***** Hindlll and a 1 .4 
1 ^ ?r° W) T?u * agment 1 of Dha 9 e to™ * 1 - 3 6»»*U8 example), containing the human heavy chain J-u 
mtronic enhancer (A. Hayday et al. (1984) Nature 307. 334-340). to it. The resulting piasmid. pMHEI. consists of thS 

TT ^S^iT* J " M intr0niC enhancers "W** 1 toa^ ir *o pUC18 such that they are exdsed on a sin- 
gle BamHI/Hindlll fragment This 2.3 kb fragment is isolated and cloned into pGP1c to generate pMHE2 pMHE2 is 
digested witt. Sail and the 13 kb Xhol insert of p36.5 doned in. The resulting piasmid. pCK2. is identical to pCK1 
f *• mouse and numan heavy chain J-n intronic enhancers are fused to the 3' end of the transgene insert' 
to modulate expression of thefinal transgene, analogous constructs can be generated with different enhancers, i e the 
mouse or rat3 kappa or heavy chain enhancer (K. Meyer and M.S. Neuberger. (1989) EMBOJ., 8, 1959-1964' S Pet- 
terson, et al. (1990) Nature. 344. 165-168). 1 

2. Isolation of rearranged kappa light chain variable segments 

Two humanjeukocyte genomic DNA libraries doned into the phage vector XEMBL3/SP6/T7 (Clonetech Laborato- 
° } Were SCTeened ^ human kappa light chain J region containing 3.5 kb Xhol/Smal frag- 
ment of p36.5. Positive dones were tested for hybridization with the following Vk specific oligonudeotide: 

oligo-65 5 '-agg ttc agt ggc agt ggg tct ggg aca gac ttc act etc 
acc ate agc-3 ' 



AtartM hybridiMd bolh V and J probes are isola,ed ** toe DNA sequence of the rearranged VJk segment 
3. Generation of transgenic mice containing rearranged human light chain constructs. 

Fragments containing functional VJ segments (open reading frame and splice signals) are subdoned into the 
un^ue Xhol srt^of vectors pCKI and pCK2 to senerate rearranged kappa tight^ialn trinsgenes. ^^^6^ 
structs are isolated from vector sequences by digestion with Notl. Agarose gel purified insert is micoinjected into mouse 
embryo pronuclei to generate transgenic animate. Animals expressing human kappa chain are bred with heavy chain 
minflocus containing transgenic animals (EXAMPLE 14) to generate mice expressing fully human antibodies 
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Because not all VJk combinations may be capable of forming stable heavy-light chain complexes with a broad 
soectrum of different heavy chain VDJ combinations, several different light chain transgene instructs are generated, 
Slg.M reaLged VJk done, and transgenic mice that result from these constructs are bred w.tf .heavy 
chain minSocus transgene expressing mice. Peripheral blood, spleen, and lymph node lymphocyteswe isolated from 
TZ TSenic (blth heZ and Tight chain constructs) animals, stained with fluorescent anbbod.« > specrfc for 
human and mouse heavy and light chain immunoglobulins (Pharmingen, San Diego, CA) and analyzed by f^ cytom- 
easing a FACScan aLlyzer (Becton Dickinson. San Jose. CA). Rearranged ligh t « *ain Jansgenes 
Sutt in L highest level of human height chain complexes on the ^ce of the h ig hest number rfB^ 
n^adverselyaffectthe immune cell compartment (as assayed by flow cytometric analys.swrth Band T cell subset spe- 
cific antibodies), are selected for the generation of human monoclonal antibodies. 

n r^n ytmrtion of unrearw noed light chain minilocus fransgenes 

1. pJCK1, a Jk, Ck containing vector for constructing minilocus transgenes 

The 13 kb Ck containing Xhol insert of p36.5 is treated with Wenow enzyme and cloned into Hindlll 
now treated, plasma pGP1d. A plasmid clone is selected such that the 5' end of the .nsert is adjacentto the vector 

CM sfte. The Suiting plasmid. P 36.5-1d. isdigested with Clal and 
Xhol insert of P 36.2 is then Wenow treated and cloned into the Clal. Wenow treated p36.5-1d. A clone s selected in 
wSch ^ the p?6.2Ynsert is in the same orientation as the P 36.5 insert. This clone. pJCK1 (Fig. 34). contains the entre 
tZn Son and Ck. together with 7.2 kb of upstream sequences and 9 kb of downstream ^"ces^Theinserl 
ateo contain? the human J-Ck intronic enhancer and may contain a human 3' k enhancer. The mse t « [^t*a 
urtqueS' Sail ste for the purpose of cloning addrtional 3" flanWng sequences such as heavy chain or l,gMcha,n enhanc- 
enj A unique Xhol site is located at the 5' end of the insert for the purpose of cloning in prearranged Vk gene seg- 
ments S 

construct away from vector sequences. 

2. Isolation of unrearranged Vk gene segments and generation of transgenic animals expressing human Ig light chain 
protein 

The Vk specific oligonucleotide, oligo-65 (discussed above), is used to probe a human placental genomic DNA 
libra^ ^ EMBL3/SP6/T7 (Clonetech Laboratories, Inc.. Palo Mo, CA). ^ g>n» 

^£3^^ clones are sequenced, and dones that appear functional are 
function*^ include: open reading frames, intact splice acceptor and donor s^uences, and ^ 
sequence DNA fragments containing selected variable gene segments are cbned ,rrto the unique 
SSki to generate minilocus constructs. The resulting clones are digested wrth Notl and the inserts ^'^d 
Kerti into mouse embryo pronuclei to generate transgenic animals. The transgenes of these ammals will unde*o V 
lo '!!?o^ Animals expressing human kappa chan are bred with heavy cha.n m.n.locus contan- 

ing transgenic animals (EXAMPLE 14) to generate mice expressing fully human antibodies. 

40 

EXAMPLE 17 

gynthfl fo Hoavv Chain Variable Region 
45 This example is outlined in Fig. 35. 
A finngtrnfrtinn of Cloning Vector oVHf 
1.pGP1f 
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The plasmid pGP1a (previous example) is digested with Notl and the following oligonucleotides are ligated to it: 

oligo-«a«5«-ggc cgc atg eta etc gag tgc aag ctt ggc cat cca-3' 
oligo-«b«5'-ggc ctg gat ggc caa get tgc act cga gta gca tgc-3' 

The resulting plasmid. pGP1f. contains Sphl, Xhol. and Hindlll sites flanked by Notl and Sfil sites. 
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The human V H -V family variable gene segment V H 251 (C.G. Hurrphries et al. (1988) Nature. 321 446) together 

^ifp?n.*T f *9 mem fr° m *8 Piasrmd done pVH251 (previous exanple) and cloned into the plasmid vector 
ESI v ST*" k I!!' 1, Wl) " 71,6 5 ' ***** 6 «^"** together with the promoter, first axon anJS 

■rtron of V H 251 , are amplrfied by polymerase chain reaction (PCR) from this template using the following oligonude- 

oligo-83 5'-cag etc gag etc ggc aca ggc gec tgt ggg-3* 
oligo-84 5' -etc tag agt cga cct gca ggc-3 ' 
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The 3* flanking sequences are amplified by PCR using the following oligonucleotides: 

oligo-85 5 '-age etc gag ccc gtc taa aac cct cca cac-2< 
oligo-86 5*-ggt gac act ata gaa tac tea agc-3 ■ 

The amplified 5' sequences are digested with Sph! and Xhol. and the 3' sequences digested with Hindlll and Xhol The 

rr»5nn 9m f f "2 *"* int0 ** ****** ^ to 9 enerate P ,asmid P VHf - PVHf contains the 

ZZ^Stf* 5 V ^ M of V H 251, together with the signal sequence encoding first 

SS£?h5 3 ? ^ for heavy Chain variab,e sequences. Such sequences are cloned into the 

Kasl/Xhol digested plasmid as described below. 

B. isolation of Variable Genp flodino Seq ucn^ 

1 . Amplification of expressed V H gene cDNA sequences 

nJ^JSLS^ iS iM ) at ^ ro 7jL urT ^ P«fP»«™" **** lymphocytes (PBL). First strand cDNA is synthesized with 
^ ^ T 9 ° l,9 °" (dT) aS a primer - 7)16 ,irst « cDNA is tootetod and tailed with oligo (dG) using ter- 
rnmal transferase The 5* sequences of IgM transcripts are then specifically arrplified by a modification of the method 

225? JT* ^ ^-AcdSa llSA fi& 8998). OligoWa anSthefolLing olSSiS 
oiigo-69 5 -gga att etc aca gga gac gag-3' 

rqT^I' T 3 'r mefS ' res P ective, y- in a Polymerase chain reaction with dG-tafled first strand PBL cDNA. Oligo- 

JiISESJK* !^1! nCeS 8nCOdin9 ami "° addS 1 1-1 7 01 ,he l9M constent domain - ^ nere *° r 6 these primers 
will amplify DNA fragments of approximately 0.6 kb that include expressed V H gene sequences. 

2. Back-conversion of cDNA sequences into germline form 

The following oligonucleotide: 

oligo-"c"5'-ctg acg act ctg tat ggc gec (ct)a(cg) t(cg) (ct) 
(cg)ag (ag)t(cg) ca(ag) ct(gt) gtg (cg)a(ag) tc(gt) 

gg(gt)-3.« 



« annealedto denatured. PCR amplified. IgM 5' sequences. Oligo-'c" includes a 21 nucleotide nondegenerate 

SPSS ^ 3 KaS ' 8ite - fo,,0wed by a 30 nuc,eoMe generate sequence that is homologous to SeTrt 
of the second axon of many human V„ segments (Genbank; Los Alamos. NM). The primer is extended with DNA 

^r^Z * ? iS °!! ,ed fr ° m unused primer * 8126 donation. The product is then denatured and 

annealed to the following oligonucleotide: 
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oligo-"d" 5'-ggg etc gag get ggt ttc tct cac tgt gtg t(cgt)t 
(aegt) (ag) (ct) aca gta ata ca(ct) (ag)g(ct)-3' 

Oliao-"d" includes a 30 nucleotide nondegenerate sequence that includes an Xhol site and part of the V to DJ reconv 

amino acids in framework region three of many human variable gene segmente. The annealed 
X^!^^ tended with DNA polymerase and the product isolated from unused pnmer 
tion^higle rounds of DNA synthesis followed by removal of primers are carried out to ensure ^®^ uwc ® ^ 

gene fragments. The product of oligo-U primer extension is amplrtied by PCR using the following 

two oligonucleotides as primers: 

oligo-"e" 5 '-ctg acg act ctg tat ggc gcc-3 • 

oligo-"f" 5*-ggg etc gag get ggt ttc tct-3 • 

The resulting0.36kbPCR product is purified by gel electrophoresis and digested ^ 

Z D?gesL products are then cloned into Kasl/Xhol digested P VHf to generate a horary ol ^^^ 'f^ 
seouences in germline configuration. Ligation into the Kasl site of pVHf recreates the spl.ee acceptor srte at the 5 end 

tS^SSSm versions of degenerate oligonucleotides V and "d" are used to amplify different poputeftons 
Scenes, and generate germline-configuration .ibraries representing those different populations (Genbank; Los 
Alamos, NN). 

r. r^n ^iirrtion of Synthetic Locus 

The entire library of synthetic germline-configuration V H genes is grown up together and plasmid DNA isolated. The 
medium copy tfasmkJ pVHf. which includes a strong transcription terminator between the ampicillin res^nce gene 
aSL ctoning site, is designed to minirrtze the expansion of partteular clones w,th,n 

digested with ail, treated with calf intestinal phosphatase to remove S phosphate groups, and then d.gested wrth NotL 
S Sf inTestina phosphatase is removed prior to Notl digestion so that only the Sfil ends are dephosphorylated. The 
^^^^*mS*M from vector sequences by agarose gel electrophoresis and ligated to the following ol.go- 
nucleotides: 

oligo-«g"5'-ggc eta act gag cgt ccc ata ttg aga acc tec -3« 

oligo-"h" 5 '-ggt tct caa tat ggg acg etc agt ta-3' 

Oligo-"h" is Wnased while oligo-"g" is left unphosphorylated. The ligation reaction is carried out wjh a large molar 
exSss of oligonudeotkles so that all of the V gene fragment Notl ends will be ligated to <**™*£fi^«£ 

5C V region fragments. Because the Sfil ends are not se« compatible, the V segments wj concatenate ■ « the same orien- 
tation such fret each V segment is separated by a single oligonucleotide spacer urat from the next V segment. 

^eZ^rs are sized byTectrophoreste and isolated from agarose gels. The size >MnM «nc*ffh 
ers mtt^dtaSy ejected into mouse embryo pronuclei together with D-J-C containing DNA fragments (suchas 
S £52 pSnserts) to generate transgenic animate with large primary repertoires. Alternately, the concatom- 

55 ers are cloned into a plasmid vector such as pGPf. 
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EXAMPLE 18 

Generation Of Lymphoid Cell Receptor 3.,^ S p ecific Antihoriips 

The inoculation of mice with xenogeneic (i.e. human) immunoglobulins (B-cell receptors) or T-cell receptors leads 
predorrtmantly to the generation of mouse antibodies directed against particular epitopes (dominant epitopes) that 
shared by all or most immunoglobulins or T-cell receptors of a given species, but differ between species. It is therefore 
difficult to isolate antibodies that distinguish particular subsets of B or T cell receptors (e.g.. idiotypes or variable region 
families). However, the transgenic mouse expressing human immunoglobulins (described in the above examples) will 
be immunologically tolerant of those shared B-cell epitopes and will therefore be useful for generating antibodies that 
distinguish subsets of human immunoglobulins. This concept is extended by generating transgenic mice expressing 
human T-cell receptor coding sequences and breeding these mice with the human immunoglobulin transgenic mice 
Such mice are inoculated with isolates containing human T-cell receptor proteins and monoclonal antibodies are gen- 
erated that recognize T-cell receptor subsets. 

Studies have demonstrated that there is a limited variability of T cell antigen receptors involved in certain autoim- 
mune diseases (T.F. Daviesetal. (1991) New England J, Mori 325,238). Because of this limited variability, it is possi- 
ble to generate human monoclonal antibodies that specifically recognize that subset of human T cells which is auto- 
reactive. 

A. Generation of B-cell s ubset specific antibodies 

Human immunoglobulin expressing transgenic mice are inoculated with immunoglobulins isolated from a healthy 
^v 0 ^? 8 Pat,ert 3 B " Cel1 mali 0 nanc y expressing a high level of a single immunoglobulin type (Miller et al. 
(1982) New Eng. J. Med. 306. 517-522). Monoclonal antibody secreting hybridomas are generated as described by 
Harlow and (Lane , (E. Harlow and D. Lane. Antibodies: A Laboratory Manual. 1988. Cold Spring Harbor Laboratory, New 
York). Individual hybridomas that secrete human antibodies that specif icaJly recognize B-cell subsets are selected. 

B. Transgenic mice expressing human T-firi l receptor seqnprmoc 

DNA fragments containing intact and fully rearranged human T-cell receptor (TCR) a and 8 genes are coiniected 
into mouse embryo pronuclei to generate transgenic animals. Transgenic animals are assayed by FACS analysis for the 
expression of both transgenes on the surface of their Tcells. Animals are selected that express only low levels of the 
human a and p TCR chains on a fraction of their T-cells. Only low level expression is required to obtain immunological 
tolerance, and high level expression will distuito the arimars immune system and interfere with the ability to mount an 
immune response required for the generation monoclonal antibodies. Alternatively, because correct tissue or cell type 
specific expression is not required to obtain immunologic tolerance, TCR a and p chain cDNA clones are inserted into 
transgene expression cassettes (T. Choi et al. (1 991) Mol. Cell. Biol u, 3070-3074) under the control of non-TCR tran- 
scnption signals. TCR a and p chain CDNA transgene constructs are coinjected into mouse embryo pronuclei to gen- 
erate transgenic animals. Ectopic expression of the TCR chains will not result in cell surface expression because the 
TCR is a multichain complex (H. Clevers et al. 1988 Ann. Rev. Immunol., fi, 629-662); however, cell surface expression 
is not required for antigen presentation (Townsend et al. (1986) Nature. 324, 575-577) and tolerance induction 

T-cell receptor a and p chain transgenic mice are bred with human immunoglobulin expressing transgenic mice to 
generate mice that are useful for generating human monoclonal antibodies that recognize specific subsets of human T- 
cells. Such mice are inoculated with T-cell derived proteins isolated from a healthy donor or from a patient with a T-cell 
malignancy expressing a single TCR type. Monoclonal antibody secreting hybridomas are generated and individual 
hybndomas that secrete human antibodies that specifically recognize B-cell subsets are selected. 

EXAMPLE 19 

Genomic Heavy Chain Human lo Transgene 

This Example describes the cloning of a human genomic heavy chain immunoglobulin transgene which is then 
introduced into the murine germline via microinjection into zygotes or integration in ES cells. 

Nuclei are isolated from fresh human placental tissue asdescribed by Marzluff. W.F.. et ai. ( 1 985). Transcription and 
Translation: A Practical Approach , B.D. Hammes and S.J. Higgins. eds.. pp. 89-129. IRL Press. Oxford). The isolated 
nuclei (or PBS washed human spermatocytes) are embedded in 0.5% low melting point agarose blocks and lysed with 
1 mg/ml proteinase K in 500mM EDTA, 1% SDS for nuclei, or with 1mg/ml proteinase K in 500mM EDTA, 1% SDS 
1™ D ^«o/ spermatocytes at 50°C for 18 hours. The proteinase K is inactivated by incubating the blocks in 40ug/rrd 
pmsf in TE for 30 minutes at 50°C, and then washing extensively with TE. The DNA is then digested in the agarose 
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with the restriction enzyme Notl as described by M. Rnney in Current Protocols in Mplecglar Biology (F. Ausubel et sl- 
eds. John Wiley & Sons, Supp. 4, 1 988, e.g., Section 2.5.1 ). 

The Notl digested DNA is then fractionated by pulsed Held gel electrophoresis as described by Anand. R. et al. 
(1989) Adds Res. . 1Z, 3425-3433. Fractions enriched for the Notl fragment are assayed by Southern hybridiza- 
tion to detect one or more of the sequences encoded by this fragment. Such sequences include the heavy chain D seg- 
ments J segments, and 7 1 constant regions together with representatives of all 6 V H families (although *.s fragment is 
identified as 670 kb fragment from HeLa cells by Berman et al. (1988). supra., we have found it to be an 830 to frag- 
ment from human placental and sperm DNA). Those fractions containing this Notl fragment (see Fig. 4) are .gated into 
the Notl cloning site of the vector pYACNN as described (McCormick. M. et al. (1990). Scfjnjgiig 2, 65-71). P asmid 
pYACNN is prepared by digestion of pYACneo (Clontech) with EcoRI and ligation in the presence of the oligonucleotide 

5' AAF TGiC G^GC CGC ~ 3' 

YAC clones containing the heavy chain Notl fragment are isolated as described by Traver et al. (1989). Proc, NatJ 
Acad Sci USA §6 5898-5902. The cloned Notl insert is isolated from high molecular weight yeast DNA by pulse field 
gel eiectrophoresis'as described by M. Finney, op. crt. The DNA is condensed by the addition of 1 mM I spermine! ind 
microinjected directly into the nucleus of single cell embryos previously described^ Alternatrvely the DNA is isolated £y 
pulsed field gel electrophoresis and introduced into ES cells by lipofection (Qnirke et al. (1991). EMBQJ-. 1SL 1629- 
1 634), or the YAC is introduced into ES cells by spheroplast fusion. 

EXAMPLE 20 

Discontinuous Genomic Heavy Cha in la Transaene 

An 85 kg Spel fragment of human genomic DNA, containing V H 6, D segments, J segments, the n constant region 
and part of the y constant region (see Fig. 4). has been isolated by YAC Cloning essentially as described in ^Exampte r 
A YAC carrying a fragment from the germline variable region, such as a 570 kb Notl fragment upstream of the 670-830 
kb Notl fragment described above containing multiple copies of V, through V 5 , is isolated as described. (Berman et al. 
(1988). supra, detected two 570 kb Notl fragments, each containing multiple V segments.) The two fragments are coin- 
iected into the nucleus of a mouse single cell embryo as described in Example 1 . 

Typically, coinjection of two different DNA fragments result in the integration of both fragments at the same insertion 
site within the chromosome. Therefore, approximately 50% of the resulting transgenic animals that contain at least one 
copy of each of the two fragments will have the V segment fragment inserted upstream of the constant region containing 
fragment Of these animals, about 50% will carry out V to DJ joining by DNA inversion and about 50% by deletion, 
depending on the orientation of the 570 kb Notl fragment relative to the position of the 85 kb Spel fragment. DNA is iso- 
lated from resultant transgenic animals and those animals found to be containing both transgenes by Southern blot 
hybridization (specifically, those animals containing both multiple human V segments and human constant region 
genes) are tested for their ability to express human immunoglobulin molecules in accordance with standard techniques. 

EXAMPLE 21 

Joining Overl apping YAC Fragments 

Two YACs carrying a region of overlap are joined in yeast by meiotic recombination as described by Silverman et 
al (199 o) Pmr Natl Acad. Sd. USA. SI 9913-9917, to derive a single, large YAC carrying sequences from both 
smaller YACs. Thetwo YACs are aligned with respect to the arms, such that the joined YAC will contain one centromenc 
vector arm and one non-centromeric vector arm. If necessary, the insert is redoned in the vector using unique restric- 
tion sites at the ends of the insert. If the insert is not a unique restriction fragment, unique sites are inserted into the 
vector arms by oligonucleotide transformation of yeast, as described by Guthrie and Fink, op. at. To join YACs rarrying 
noncontiguous sequences which do not overlap, an overlap is created as follows. The 3' terminal region of the 5 YAC 
and the 5' terminal region of the 3' YAC are subcloned. joined in vitro to create a junction fragment and reintroduced 
into one or both YACs by homologous recombination (Guthrie and Fink, op crt). The two YACs are then meiotically 
recombined as described by Silverman et al.. op cit). The joined YAC is introduced into mice. e.g., as in Example 1 . 

EXAMPLE 22 

; Genomic k Light Chain Hu man la Transaene 

A map of the human k light chain has been described in Lorenz. W. et al. (1987). Nucl. Acids Res. , 15. 9667-9677 
and is depicted in Fig. 1 1 . A 450 kb Xhol to Notl fragment that includes all of Ck, the 3' enhancer, all J segments and 
at least five different V segments (a), or a 750kb Mlul to Notl fragment that includes all of the above plus at least 20 
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more V segments (b) is isolated and introduced into zygotes or ES cells as described in Example 1 . 
EXAMPLE 23 

Genomic k Light Chain Human lo Trartsn a ne Formed bv In Vivo Homologous Recombination 

The 750kb Mlul to Notl fragment is digested with BssHII to produce a fragment of about 400 kb (c). The 450 kb Xhol 
to Notl fragment (a) plus the approximately 400 kb Mlul to BssHII fragment (c) have sequence overlap defined by the 
BssHII and Xhol restriction sites shown in Fig. 1 1 . Homologous recombination of these two fragments upon microinjec- 
tion of a mouse zygote results in a transgene containing at least an additional 15-20 V segments over that found in the 
450 kb Xhol/Notl fragment (Example 22). 



EXAMPLE 24 

,5 Wenttf'Cation of functionally rearranged variable region spgu e nces in transgenic B cells 

u ^ * int6reSt iS USed t0 immunize ( see Harlow and Lane. Antibodies: A Laboratory Manual Cold Spring 

Harbor, New Nbrk (1 988)) a mouse with the following genetic traits: homozygosity at the endogenous having chain locus 
for a deletion of J H (Examples 9 and 12); hemizygous for a single copy of unrearranged human heavy chain minilocus 
transgene (examples 5 and 14); and hemizygous for a single copy of a rearranged human kappa light chain transgene 
(Examples 7 and 16). 

Following the schedule of immunization, the spleen is removed, and spleen cells used to generate hydridomas 
Cells from an individual hybridoma clone that secretes antibodies reactive with the antigen of interest are used to pre- 
pare genomic DNA. A sample of the genomic DNA is digested with several different restriction enzymes that recognize 
unique six base pair sequences, and fractionated on an agarose gel. Southern blot hybridization is used to identify two 
DNA fragments in the 2-10 kb range, one of which contains the single copy of the rearranged human heavy chain VDJ 
sequences and one of which contains the single copy of the rearranged human light chain VJ sequence. These two 
fragments are size fractionated on agarose gel and cloned directly into pUC18. The cloned inserts are then subdoned 
respectively into heavy and light chain expression cassettes that contain constant region sequences. 

The plasmid clone pyel (Example 14) is used as a heavy chain expression cassette and rearranged VDJ 
sequences are cloned into the Xhol site. The plasmid clone pCK1 is used as a light chain expression cassette and rear- 
ranged VJ sequences are cloned into the Xhol site. The resulting clones are used together to transf ect SP 0 cells to pro- 
duce antibodies that react with the antigen of interest (M.S. Co. et al. (1991) Proc. Natl. Acad. Sci USA 88:2869). 

Alternatively, mRNA is isolated from the cloned hybridoma cells described above, and used to synthesize cDNA. 
The expressed human heavy and light chain VDJ and VJ sequence are then amplified by PCR and cloned (J W Larrich 
et ai. (1989) Biol. Technology, Z:934-938). After the nucleotide sequence of these clones has been determined oligo- 
nucleotides are synthesized that encode the same polypeptides, and synthetic expression vectors generated as 
described by C. Queen et al. (1989) Prop. Ngtl. Acqd. Sci. U,SA 84:5454-5458. From the above, it will be apparent that 
various aspects of the invention can be clearly identified. These include: 

(a) a transgenic mouse having a genome comprising a human immunoglobulin heavy chain minilocus transgene 
the transgene being capable of rearrangement and switching in the mouse, whereby two or more rearranged 
human heavy chain isotypes are produced; 

(b) a transgenic mouse having a genome including a transgene comprising DNA sequences that encode human 
variable (V), diversity (D), joining (J) and constant regions of a human Ig protein, which sequences are operably 
linked to transcription regulatory sequences and capable of undergoing gene rearrangement in vivo, when inte- 
grated in a non-human transgenic animal, to produce a rearranged gene encoding a human heavy chain polypep- 
tide, said construct also comprising a switch donor region 5* from a p constant region and a human y switch 
acceptor region between the u constant region and a human T constant region, said switch sequences being oper- 
ably linked to effect switching in vivo and the production of human y heavy chain polypeptide; and 

(c) an immunoglobulin (Ig) heavy chain minilocus transgene construct comprising DNA sequences that encode 
human variable (V), diversity (D), joining (J) and constant regions of a human Ig protein, which sequences are oper- 
ably finked to traru»cnption regulatory sequences and capable of undergoing gene rearrangement in vivo, when 
integrated in a non-human transgenic animal, to produce a rearranged gene encoding a human heavy chain 
polypeptide, said construct also comprising a mu switch donor region 5' from a n constant region and a human y 
switch acceptor region between the n constant region and a human y constant region, said switch sequences being 
operably linked to effect switching in vivo and the production of human y heavy chain polypeptide. 
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The foregoing description of the preferred embodiments of the present invention has been presented for purposes 
of illustration and description. They are not intended to be exhaustive or to limit the invention to the precise form dis- 
closed, and many modifications and variations are possible in light of the above teaching. 

All publications and patent applications herein are incorporated by reference to the same extent as if each individ- 
ual publication or patent application was specifically and individually indicated to be incorporated by reference. 

Such modifications and variations which may be apparent to a person skilled in the art are intended to be within the 
scope of this invention. 

Claims 

1 A transgenic mouse having a genome comprising a human immunoglobulin heavy chain minilocus transgene, the 
transgene being capable of rearrangement and switching in the mouse, whereby two or more rearranged human 
heavy chain isotypes are produced. 

2. A transgenic mouse of claim 1 , wherein the isotypes are \i and y. 

3 A transgenic mouse having a genome including a transgene comprising DNA sequences that encode human var- 
iable (V) diversity (D), joining (J) and constant regions of a human Ig protein, which sequences are operably linked 
to transcription regulatory sequences and capable of undergoing gene rearrangement in vivo, when integrated in a 
non-human transgenic animal, to produce a rearranged gene encoding a human heavy chain polypeptide, said 
construct also comprising a \i switch donor region 5' from a \i constant region and a human y switch acceptor region 
between the *i constant region and a human y constant region, said switch sequences being operably linked to 
effect switching in vivo and the production of human y heavy chain polypeptide. 

4 An immunoglobulin (Ig) heavy chain minilocus transgene construct comprising DNA sequences that encode 
human variable (V), diversity (D), joining (J) and constant regions of a human Ig protein, which sequences are oper- 
ably linked to transcription regulatory sequences and capable of undergoing gene rearrangement in vivo, when 
integrated in a non-human transgenic animal, to produce a rearranged gene encoding a human heavy chain 
polypeptide said construct also comprising a mu switch donor region 5' from a p. constant region and a human y 
switch acceptor region between the » constant region and a human y constant region, said switch sequences being 
operably linked to effect switching in vivo and the production of human y heavy chain polypeptide. 

5. The use of a construct of claim 4 in producing a transgenic non-human animal capable of the production of human 
y heavy chain polypeptide in response to antigenic challenge. 

6 A process for the production of a transgenic non-human animal capable of the production of human y heavy chain 
polypeptide in response to antigenic challenge, comprising functionally disrupting the endogenous immunoglobulin 
heavy chain locus and inserting into the animal genome a transgene construct of claim 4. 

7. The use of an animal of any one of claims 1 to 3 or an animal obtainable by a process of claim 6 in the production 
of B cells that produce y immunoglobulin having human heavy chain and binding to a selected antigen. 

8 A process for the production of B cells that produce y immunoglobulin having human heavy chain and binding to a 
selected antigen, comprising challenging an animal of any one of claims 1 to 3 or an animal obtainable by a process 
of claim 6 with said antigen and screening for B cells from said animal that bind said antigen. 



9. B cells obtainable by a process of claim 8. 

10. A hybridoma obtainable by immortalizing B cells of claim 9. 

11. a hybridoma obtainable by fusing a B cell of claim 9 with a myeloma cell. 

12. The use of B cells of claim 9 in producing a hybridoma or corresponding monoclonal antibody. 

1 3. A process for producing monoclonal antibody comprising cultivating a hybridoma of claim 10 or claim 1 1 . 

14 A process for the production of y immunoglobulin having human heavy chain and binding to a selected antigen, 
comprising challenging an animal of any one of claims 1 to 3 or an animal obtainable by a process of claim 6 with 
said antigen and obtaining y immunoglobulin therefrom. 



50 



EP0814159A2 



-A 



// i \ \ i// 

FR1 CDR1 FR2 CDR2 FR3 CDR3 FR4 C 

/ 





1 











AAAAAAA 



F/G. 7 



Hoc 

V;. 



'N 



J A C >. 



F/G.2 



850 Kb 



750 Kb 



1 Mb 



t 

Kde 



25 Kb 



FIG. 3 



51 



EP 0 814 159 A2 




52 



EP0 814159A2 



VECTOR 



W 



PROMOTER 

■* i= ^-r 



Y 



2 



PRIMERS 



-a 

P2 ■ 



D- 

Pi 



GENE 



mRNA 









2 


1 







FR1 CDR1 FR2 CDR2 FR3 CDR3 FR4 C 



AAAAAAA 



F/G.5 



53 



EP 0 814 159 A2 



mRNA 



AAAAAAA 



I 



pi 



P1 + REVERSE TRANSCRIPTASE 



-OKI- 



P2 



I 



P2 f DNA POLYMERASE 



■cm- 



PURIFY cDNA 



P3 



P4 



PCR AMPLIFY WITH P3 «• P4 



-I 

PROMOTER _ jgj ^ 



w L * u - 1 



I 



z 



CONSTRUCT V REGION LIBRARY 



Z W Z w /^z 



-ke- 



CONCATENATE LIBRARY 



F/G. 6 



54 



EP0 814159A2 



1 ,Kt> J, CH4 

I — ' J a ~ ^CH3 , 

CH2 



D J ?fl J 4 CH"f II M1M2 

-m- hh — s 

Kpnl Kpnl CHROMOSOME 



HSV tk neo 



neo 

pcr 



TARGETING VECTOR 



Kpnl RECOMBINANT 



FIG. 7 




FIG. 8 



55 



EP 0 814 159 A2 




FIG. 9 



LVDJ E M C t1 E H3 

-U — 0 HI IB — »H HH+-0- 

ii ii : — iL_i 

FRAGMENT 1 2 3 4 

FIG. 10 



56 



EP 0 814 159 A2 



o 
2 



,2 — 
2 c/5 



X 
to 
« 
CD 



TO 

CO _ 











o 


o — 




JO 
X 


JC =r 
X CO 


I i ■ 


I 


-U. 


Ob 


Ab 


u 






• Xho I 



CO 



CO 



I I 



X - 

co O 
CD 2 



B Ji/ d 



K 



450 Ko 



Not ! 



(A) 



Mlu I 
i — 



Mlu ! 
I — 



750 Kb 
BssH I! 



-400 Kb 



Not t 



(B) 



(C) 



F/G. 11 



57 



EP 0 814 159 A2 



5 
cc 

ID 




58 



EP0814 159 A2 



OllGO-1 OLIGO-2 QLIGO-3 

OLIGO-4 OLIGO-5 OUGO-6 



82 Xba I 
76 EcoR V 115 Kpn I 
60 Sty I lllSmal 
60Ncol 94Sphl 127Mlul 

,o o, 55 BamH 1 89 Xh0 1 1 21 HinD til 
^? Sfil 75C!al 106 EcoR I 139 Not I 

40 Not I 65BssHII 100 Bat It 133 Spe? 
J ill II < ill i i "i I i i | 



147 BASE PAIRS 



pGPl POLYLINKER 



FIG. 13 



59 



EP 0 814 159 A2 



pMUM INSERT 



B BgV E E E BH B B 
' ■■■ ; ; im— i :■/■ .'.^y/^ -m*** 2 ! L 



°- s - c~ 



6.3 Kb Bamri i/HmD III 1 0 Kb HmD 111/BamH I 1 .5Kb BamH I 



(A) (B) (C) 

HUMAN u LOCUS 

F/G. 74 



60 



EP0814159A2 




61 



EP0 814 159 A2 



HUMAN C Y i GENE 



HinD III 



Bgl II 
i — 



HinD III 

— ■— a-^ — e — s- 



CH-, H CH 2 CH 2 



Bgl II 
+E — i — 



TM 



5.3 Kb 



(B) 



7.7 Kb 



(A) 



HinD III 



Bam I 



Sfil^- 
Not I 



RAT 3' 
ENHANCER 
4 




pRE3 



,Sph I 
>^Bol II 
— HinD III 
Sfil 
Spe I 
Not I 



Bal ii 
HinD III 



(A) - (B) 



POLYLINKER 




RAT 3' ENHANCER 




Bai II 
Spfil 

K* 1 " 3 ' \ ✓Not I 
ENHANCER \Zsfil 

'Bam I 



HinD III 
Sfil 
Not I 



FIG. 16 



62 



EP 0 814 159 A2 




Not I 



31.5 Kb No I 1/Spe I INSERT 




Not I 

25 Kb Not l/Spe I INSERT 



FIG. 17 



63 



EP 0 814 159 A2 



x I 



X 

« — ; x 
Q 



JO 
X 



CD 



O 

x: 
x 

jo 

CM 

in 



c 

X 



o 

X 



no 



t 

re 1 
JO 
X 



g 
o 

uj 

CC 

o 
< 

X 



< 

o 



o 



to 

X 



O 
J= 
X 



TO 
JO 
X 
XI 

. eg 



CD 
X) 
X 

JO 

CM 

CO 



CO 
CD 



o 



64 



EP0814159A2 




65 



BamH l 
i i HWr 



EP 0 814 159 A2 



Sma ! 

. Sma ' c ma . BamH I BamH I BamH I 



Sma I BamH 



11 Kb BamH 



5 Kb Sma I 16 Kb Sma I 



FIG. 20 




Not I 



FIG. 21 



66 



EP0814159A2 





67 



EP0814159 A2 



MOUSE HEAVY CHAIN LOCUS 



Kpn 



EcoR I E coR 1 



Kpn I 



• ■ i ' 



i 

] | EcoR I 
Sph I Nae I 



FIG. 23a 



68 



EP0814159A2 



pGK Neo 




Klenow EcoR I 



pGK Neo BLUNT 
(A) 

FIG. 23b 



69 



EcoR I 



Not I 
Spn i/Nae I 

i 

! - pGK Neo FROM pGEM7(KJ1) (A) 
i 5 - . AATTCATG - 3' 




FIG. 23c 



70 



EP0 814 159 A2 



EcoR I HinD III 
i ~~ "■ i 

pGKTK 

pMK02 — 




FIG. 23d 



71 



EP0 814159 A2 



MOUSE KAPPA GENE 



EcoRi Boin _ BgMI Bglll 

_ 3_ 



Sph I Hpa 



2.3 Kb 
Ba! 11 



F/G. 24a 



72 



EP 0 814 159 A2 




FIG. 24b 



73 



EP0814159A2 



BssH II 




; BssH 11 DIGEST 

i 

+ 2 Kb Sal I TO EcoR I pGK Neo 
FRAGMENT W/ BssH II LINKER 
ADAPTUS 




FIG. 24c 



74 



EP0814159 A2 



a: 
o 

O 
UJ 

> 
O 



UJ 

O 

< 
X 

o 
I- 

X 

g 
< 



Xba I 



Sac I 
Xho I 



Xba I 
Xbal-j 
Bgl II 



Xba I- 
Bgl II - 

Sac I 

Bgl II 



f 3 



Sac L 
Xbai^ 
Pstl^ 
Bgl ir 



Sph I 

Stul 
EcoRI 
Sph I 
Xba I 
Bal II 



EcoRI _ 



Sac 1/EcoR I 
Sac I 
Xba I 

Bgl II 
Pstl 



Sac i 
Xho I 



Xba I 
Xba I 
Bgl II A 



Sac I ' 
Xho I 



Xbal^ 
Xbal- 
Bgl II 



Xba I - 
Bgl n -: 

Sac i - 

. Bal u - 
Pstl-; 

Pstl.; 
Pstl-; 

EcoRI 
Xho I I 
Xba l I 
Sph l | 



O 

UJ 

z 



Xba i -. 
Bgl II - 

Sac I - 



Bal II -| 
Pstl 

Pstl 
Pstl 
EcoR l_n 

Xhol^ 

Xba 17 

Sph l' 

_Styl = 
EcoRI | 
Xho I 
Xba I I 
Sph I | 



Not I v 
HinD III V 



BamH I 
Cla i 
Xho I 



Xba I 
Xba I 
Bgl II 



Xba I 
Bgl II 

Sac I 



I 



Bal II -! 
Pst It 

Pstl § 
Pstl= 
EcoR |J 
Xho YT 
Xba I/ 
Sph Y 

_ Stu I = 
EcoRI | 
Xho I 
Not) I 



O 

UJ 



Sac I 
Xho I 



Xba I 
Xba I 
Bgl II 



Xba l 



Xba I 
Bgl II 

Sac I - 



Bgl II 
Pst It 

Pstl^ 
EcoR 
Xho VTT 
Xba l/ 
Sph Y 
Stul 
EcoRI 
Sph I - 
Xba I 
JgNI 



EcoRI 1 



Sac 1/EcoR I 
Sac I 
Xba I 

Bgl II 
Pstl 



O 

UJ 



o 
io 



CD 



CO 

o 
o 

— i 

o 

O 

UJ 

O 



CO 



CL 



If) 
CD 

6 

UJ 



o 



CO 

r> 
o 
o 

-J 

CL 
< 

O 
UJ 
K 
UJ 

O 
CC 

< 



UJ 



75 



EP 0 814 159 A2 



BamH I H 



cc 
o 

t— 

o 

UJ 

> 

C 
z 

h- 

UJ 

O 

< 
*- 

< 
x 
o 

> 
< 

Hi 
X 

UJ 

CO 

o 



Hind III - 
Spe I 

EcoR I - 



Xho I 



Nae I 

Hind III 
BamH I 
Hmd III 
BamH I 
Sac l 



JH 4 

JH3 
JH 2 
JH 1 



NCO I 



Stul 
Hind III 

BamH I 
Ncol 

EcoR I 
Hind III 
EcoR I 



Ncol 
EcoR! 
BamH I 



EcoP. I - 



Hind Ml 
Sph 1 
Pst I 
Sal I 
Xba I 
BamH I 
Stu l/Sma I 
EcoRI 



Sal l/Nae I 
BamH I 

Stul 
Spe I ^ 
Xho i EcoR I ^ 



Not I 
Sail 
•Xho I 
Xba I , 
EcoR 



<X3 
CM 

CD 

g: 



5p 
lxjcc 
(/>< 

LUuj 

051 
U10- 



Xba l/Nae I 
Hind III 
BamH I 

Hind III 
BamH I 
Sac I 
EcoRI 



Hind III 
Not I 



w 

=3 
O 
O 



O 
UJ 



o 



o 

2 
Ul 

O 



CO 

o 

Q_ 



o 

c 



CD 



O 



76 



EP 0 814 159 A2 



BamH I -I 



tr 
O 
f- 

o 

> 

o 
h- 

LU 

O 

< 

o 

> 
< 

UJ 

X 

UJ 
CO 

O 
5 



Not I 
Sal i 
Xho i-| 

Stu I 
Hind J li - : 

BamH f - ; 

i 
I 

EcoR I 
Hind III - 
EcoR I - 



Xbal ( 
EcoR IV, 



Hind lllvr 



Not I 
Sail 
EcoR l/Xho I 



Sal l/Nae I -p 
BamH I - 



EcoR l/Xho I 

j 

Stu I - : 
Hmd in - 

BamH i - 



EcoR l - 



Hind (II 



j 



Xba I 



Hind \\\+r 

Notn 

I 

I 



o 

LU 
2 



Hind III 



Hmd III 
Spe I 



Stu T-J 
EcoR I 



Sal l/Nae I 4 
BamH i 



EcoR l/Xho I 



o 



1 


Nco 1 " 




Stu I - 




Hind III - 


v. 


BamH 1 - 




Nco 1 - 


\ 


EcoR 1 - 








Hind III - 


\ 


EcoR 1 - 



Nco I - 



O 

UJ 



ZD 
O 

o 



o 

Q. 



o 



o 

LU 
LU 

o 
< 



LU 



77 



EP 0 814 159 A2 




FIG. 27 
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